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PREFACES 


Preface to the Second Edition 

When the present book was written in the early 1980s, the theory of tree automata, tree 
languages and tree transformations was young but already quite extensive. Our aim 
was to give a systematic and mathematically sound exposition of some central parts of 
this subject. The presentation uses universal algebra in the spirit of J. R. Biichi and 
J. B. Wright from whose ideas of automata as algebras tree automata once emerged. That 
the algebraic formalism encourages and supports precise definitions and rigorous proofs 
may explain why the book has remained a general reference for many mathematically 
minded workers in the field ever since its publication in 1984. Unfortunately, it has long 
been out of print and hard to obtain. 

Soon after the regrettable death of Ferenc Gecseg in October 2014, Zoltan Fiilop 
(Szeged) and Heiko Vogler (Dresden) proposed a reissue of this book. Akademiai Kiado, 
the original publisher, did not find the project feasible but gave us free hands to proceed 
on our own. Professor Gecseg’s family also willingly endorsed the idea. Since the book 
did not exist in any electronic form, the whole text had to be retyped in Latex. For this 
exacting task Fiilop and Vogler quickly assembled a highly qualified team that, besides 
themselves, included Johanna Bjorklund (Umea), Frank Drewes (Umea), Zsolt Gazdag 
(Budapest), Eija Jurvanen (Turku), Andreas Maletti (Stuttgart), George Rahonis (Thes¬ 
saloniki), Kai Salomaa (Kingston, Ontario), and Sandor Vagvolgyi (Szeged). Professors 
Fiilop and Vogler also undertook the overall management of the work. The generous 
contributions of all these individuals are acknowledged with many thanks. 

From the very beginning it was decided that this new edition should be true to the 
original one. In particular, the terminology was preserved even in cases in which some 
alternative terms have become prevailing. However, a few mistakes were corrected and 
a couple of obscure passages were clarified. 

Of course, the book was never claimed to offer a complete presentation of its subject 
matter. In fact, some important topics were totally left out. It was hoped that the ex¬ 
tensive bibliography, fairly complete up to around 1982, and the notes and references at 
the end of each chapter would, at least partly, make up for the shortcomings. Now, over 
thirty years later, the incompleteness is naturally even more obvious. Much progress has 
been made in already established areas and many new topics have emerged. Some of the 
new work is strongly motivated by applications, old or new. No book of this size could do 
justice to all these developments. Instead, we have to trust that the matters presented 
here still belong to the core of the theory and are worth studying by anyone who wants 
to work in this field. Moreover, to account for more recent contributions and lines of re¬ 
search, an appendix has been added to the book. In it several topics are briefly surveyed 



and some relevant references are given to help an interested reader get started on them. 
I thank Heiko Vogler and Zoltan Fiilop for some important additions to the bibliography. 

Turku Magnus Steinby 

August 2015 


Preface to the Original Edition 

The purpose of this book is to give a mathematically rigorous presentation of the theory 
of tree automata, recognizable forests, and tree transformations. Apart from its intrinsic 
interest this theory offers some new perspectives to various parts of mathematical lin¬ 
guistics. It has also been applied to some decision problems of logic, and it provides tools 
for syntactic pattern recognition. We have not even tried to discuss all aspects of the 
subject or any of the applications, but enough central material has been included to give 
the reader a firm basis for further studies. Being relatively new and very manyfaceted, 
the field still lacks a uniform widely accepted formalism. We have chosen the language of 
universal algebra as our vehicle of presentation. However, we have not assumed that the 
reader is familiar with universal algebra; the preparatory sections in Chapter [1] should 
make the book self-contained in this respect. On the other hand, it is natural to assume 
that anyone interested in such a book has some general mathematical training and some 
knowledge of finite automata and formal languages. 

The book consists of four chapters, a bibliography and an index. The first chapter 
contains an exposition of the necessary universal algebra and lattice theory, as well as a 
quick review of finite automata and formal languages. We also recommend some books 
on these subjects. In Chapter [2] trees, forests, tree recognizers, tree grammars, and some 
operations on forests are introduced. Several characterizations and closure properties 
of recognizable forests are presented. Chapter [3] is devoted to the connections between 
recognizable forests and context-free languages. Chapter H] deals with tree transducers 
and tree transformations. Chapters [2H1] contain some exercises. Each of these chapters 
is concluded with some historical and bibliographical comments. We also point out some 
topics not discussed in the book. We have tried to make the Bibliography as complete as 
possible. Of course, it has not always been easy to decide whether a given item should 
be included or not. 

We want to thank our colleagues and the staffs at our institutions for the good working 
atmosphere in which this book was written. Dr. Andras Adam and Professor Istvan Peak 
gave the text a careful scrutiny. We gratefully acknowledge their many remarks. We are 
also indebted to Dr. Zoltan Esik for his very helpful comments on Chapter 01 We wish to 
express our warmest thanks to Mrs. Piroska Polberth for performing very competently 
the difficult task of typing the manuscript. Also, we want to thank our wives and 
daughters for their support and for putting so gracefully up with the inconveniences 
inevitably caused by our undertaking. 

The writing of the book has involved several trips between Turku and Szeged. We 
gratefully acknowledge the financial support provided by the Academy of Pinland, the 
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Hungarian Academy of Sciences, the Janos Bolyai Mathematical Society, the University 
of Szeged, and the University of Turku. Our work was also furthered by a possibility for 
the first-named author to spend a term at the Tampere University of Technology. For 
this thanks are due Professor Timo Lepisto. 
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NOTES TO THE READER 


Within each section, there is one counter which is incremented by each of the environ¬ 
ments definition, lemma, theorem, corollary, and example. The end of a proof or an 
example is indicated by the mark □. It appears immediately after a theorem, lemma or 
corollary if this is not followed by a proof. The references to the literature are by the 
author(s) and the number with which the publication occurs in the Bibliography. In a 
few cases we refer to a book mentioned at the end of Chapter [TJ 
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1 PRELIMINARIES 


In this chapter we shall review some basic concepts and results from the theories of 
automata, formal languages, and universal algebras. It is reasonable to assume that 
a potential reader of this book already knows something about automata and formal 
languages. On the other hand, we do not presuppose any knowledge of universal algebra. 
These two assumptions suggested the styles and extents of the following seven sections. 

Section 11.11 (Sets, relations and mappings) may be skimmed through for terminology 
and notation. 

Sections O and II.,11 present the required universal algebraic concepts and results. 
These are not many, but they should be mastered well as the very basic concepts of the 
theory of tree automata are defined in terms of universal algebra. We have tried to make 
the book self-contained in this respect, but a reader who wants to pursue further the 
algebraic aspects of the theory should certainly consult one of the references on universal 
algebra. 

The lattice theory presented in Section [1.41 is less important here, and the reading of 
this section may be postponed until needed. 

Sections ll.5lll.6l and ll.7l survev some of the most essential facts about finite recognizers, 
regular languages, context-free grammars, and (generalized) sequential machines. A 
reader less familiar with these matters would do wisely to look up these subjects in some 
of the references given at the end of the chapter. 

1.1 SETS, RELATIONS AND MAPPINGS 

The set theory needed here is very elementary and most of our set theoretic notation is 
well-known. However, a few conventions should be pointed out: 

(i) A C H means that the set A is a subset of the set B. Proper inclusion is denoted 
by A C H. 

(ii) 0 denotes the empty set. 

(iii) |A| denotes the cardinality of the set A. 

(iv) The power set of a set A, i.e., the set of all subsets of A, is denoted by pA. 

(v) The union of a family (A, | i G 7) of subsets (indexed by 7) of some set is written 
as U(^* \ i ^ Similarly, n(^* | z G 7) is the intersection. 

(vi) The set {x G A | Pi{x),... ,Pfc(x)} of all elements x in A with the properties Pi, 
..., Pfc may also be written as {x | Pi(x),..., Pfc(x)} when A is understood from 
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the context. We shall use this notation in the following more general form, too. 
Suppose f{xi, ..., Xm) is an object defined in some way in terms of the objects xi, 
..., Xm- Then 

{f{xi, Xm) I Pixi, Xm)} 

is the set of all such objects constructed from objects xi, ..., Xm satisfying the 
condition P{xi, ..., Xm)- Furthermore, we use 

■ ■ ■ 1 Xm)i ••• 1 fki^l , . . . , Xm ) I T’(xi, . . . , Xm)} 

as a short form for the union 

{/l(xi, . . .,Xm) I P{xi, . . .,Xm)} U • • • U {/fc(xi, . . . , Xm) \ P{xi, . . .,Xm)}- 

(vii) If there is no danger of confusion, we may write simply a for the one-element set 
{a}. Of course, we should not write 0 for {0}. 

Sometimes we employ some notation from logic as abbreviations: 

(i) “(Vx G A) P{x)” states that P{x) holds for all x € A. 

(ii) “(3x G A) P{x)” states that there exists an x in A such that P{x) holds. 

(iii) “P Q” means that Q holds if P holds. 

(iv) “P Q” states that the conditions P and Q are equivalent, i.e., both of them 
hold or then neither one holds. 

(v) “P A Q” is the statement that both P and Q hold. Similarly, “P V Q” states that 
at least one of P and Q holds. 

The numbers dealt with here are always integers and mostly even non-negative integers. 
When we write “... for all n > 1” we mean, in fact, “... for all integers n > 1”. The 

set of all integers is denoted by Z, the set of the natural numbers 1, 2, ... by N, and 

the set of all non-negative integers by Nq. 

Let A and B be sets and g C A x B a (binary) relation from ^ to P. The fact that 
(a, b) £ Q {a G A, b G B) is also expressed by writing agb oi a = b (g). The opposite case 
may be expressed by or by a ^ 6 (g). For any a G A, we put 

ag = (b G B I agb}. 

This notation is extended to subsets of A: 

Aig = I a G Ai) for Ai C A. 

The converse of g is the relation 

g~^ = {{b,a) \ {a,b) G g} C B X A. 
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1.1 Sets, relations and mappings 


Obviously, 


bg ^ = {a ^ A \ agb} 


and 

Biq~^ = {a G a I (36 G Bi)aQh} 

for all 6 G -B and Bi C B. The domain of g is the subset dom(^j) = Bq~^ of A, and its 
range is the subset range(ip) = Aq of B. 

The product or composition of two relations g C A x B and r C B x C is the relation 


go T = {(a,c) I (36 G B)agbTc} C A x C. 


In this definition we used the short form agbrc to express the fact that agb and 6rc. 
Often we write gr for g o t. The product of relations is associative. We note also the 
equality {g o r)“^ = o g~^. 

Consider now (binary) relations on a set A, i.e. subsets oi A x A. These include the 
diagonal relation 6 a = {(«,«) | a G T} and the total relation la = A x A. For any 
relation ^ on we define the powers g^ (n > 0) with respect to the product of relations: 

1° g^ = 6 a and 
2° = g'^og for n > 0. 


The relation C x is called 

(a) reflexive if 6a C g, 

(b) symmetrie if g~^ C g, 

(c) antisymmetric if O C 6^ and 

(d) transitive if g'^ C g. 

The intersection of any reflexive relations (on a given ^4) is reflexive, and the intersec¬ 
tion of transitive relations is transitive. Thus there exists for every g A x A a unique 
minimal reflexive, transitive relation g* containing g. It is called the reflexive, transitive 
closure of g. One verihes easily that 

= 6a U U U U ..., 


i.e., for any a, b G A we have ag*b iff 


a — a\ g U2 g ^3 * * * —i g — b 

for some n > 1 and ai, On G A. 

A relation on A is called an equivalence relation on A, if it is reflexive, symmetric 
and transitive. The set of all equivalence relations on A is denoted by E{A). Clearly, 
6a G B(A) and /.a G E{A). Let g be an equivalence relation on A. The g-class (or 
the equivalence class modulo g) of an element a G ^ is the set ag. Obviously, agb iff 
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ag = bg. We shall also write a/g for ag and extend this notation to subsets Ai C A 
and n-tuples a = (ai,... , 0 ^) of elements of ^ (n > 1): Ai/g = {a/g \ a G ^ 1 } and 
a/g = {ai/g,... ,an/g)- The quotient set of A modulo g is A/g. Obviously, A/g is 
a partition on A, that is, every element of A belongs to exactly one fp-class. On the 
other hand, every partition on A can be obtained this way as the quotient set from a 
unique equivalence relation and there is a natural one-to-one correspondence between 
the partitions on A and E{A). The cardinality of A/g is called the index ol g G E{A). 
If \A/g\ is finite, we say that g is of finite index. We say that g G E{A) saturates the 
subset H Q A\i Hg = H, i.e., if H is the union of some £»-classes. 

A mapping or a function from a set A to a set S is a triple (A, B, ip), where ip ^ Ax B 
is a relation such that for every a G A there exists exactly one b G B satisfying atpb. As 
usual we write ip: A ^ B and say that (/9 is a mapping from A to B. If a(pb (a G A, 
b G B), b is called the image of a and a an inverse image of b. This is expressed by 
writing b = aip, b = ip{a) or ip: a b. For a subset Ai of A we also use the two notations 
Aiip and ip{Ai) for the set {aip \ a G Ai}. The converse ip~^ of ip is always defined as 
a relation (C i? x A), but it is usually not a mapping from B to A. Again, ip~^{Bi) 
will sometimes be used instead of Biip~^ when Bi C B. Note that dom((/?) = A and 
range((/?) C B. The set of all mappings from A to -B is denoted by B^. 

The composition or product of two mappings ip: A ^ B and if: B ^ C is the mapping 

ipif: A ^ C 

where ip'ip is the product of ip and tp as relations. Clearly, aiptp = {aip)^) for all a G A. 

The restriction of a mapping (p: A ^ B to a subset C of A is the mapping 

ip\C: C ^ B 

where ip\C = ip H {C x B). It tp: C ^ B is obtained from ip: A ^ B as the restriction 
of ip to C, i.e., C* C A and p) = ip\C, then we say also that ip is an extension of p) to A. 

The kernel ipip~^ of a mapping ip: A -G B is an equivalence relation on A and 
ai = a 2 {ipip~^) iff aiip = a 2 ip (ui, 02 G A). On the other hand, one can associate 
with every 0 G E{A) a mapping 

6*^: A—)■ A/0, a !-)■ a0, (a G A) 

such that the kernel of 0*^ is 0. This 0^ is called the natural mapping associated with 0. 

A mapping ip: A ^ B is called 

(i) injective (or an injection), if ipip~^ = 5a-, 

(ii) surjective (or a surjection), if range((/9) = B, and 

(iii) bijective (or an bijection), if it is injective and surjective. 

It ip: A ^ B is surjective, one says also that (/9 is a mapping of A onto B. It is obvious 
that the natural mapping 0^ is always surjective (0 G E{A)). The diagonal relation of a 
set A defines the identity mapping A —)■ A, a i-)- a (a G A). It is denoted by 1a- 



1.1 Sets, relations and mappings 


We shall also meet partial mappings, that is, mappings for which the image of some 
elements may be undefined. A partial mapping from A to 5 is defined by a relation 
ip C A X B such that \a(p\ < 1 for all a £ A. Again, we write ip: A ^ B. If ac/? = 0, then 
we say that ip is undefined for a {a £ A). The notations and terminology introduced 
above for mappings apply to partial mappings, too, although dom((/?) may be a proper 
subset of A when ip: A ^ B is a partial mapping. 

It is convenient to think of the elements of a cartesian product Ai x • • • x A„ as n-tuples 
(ai,..., a„) with ai G Ai, ..., a„ G A„. We adopt the definition of an ordinal number n 
as the set of all ordinals smaller that n: 0 = 0, 1 = {0}, 2 = {0,1} etc. and, in general, 
n = {0,1,... , n — 1}. Then Ai x • • • x A„ can also be defined as the set of all mappings 

ip: n —>■ Ai U • • • U An 

such that iip £ Aj_|_i for i = 0, 1, ..., n — 1. Of course, we may identify such a ip with 
the n-tuple {Dip, lip ,... , (n — !)</?). Now the cartesian power A” = A x • • • x A (n times) 
is the set of all mappings ip: n ^ A. In particular, A^ = {0} since 0 is the only mapping 
from 0 to A. Note that the notation A^ is consistent with our earlier notation B"^ for 
the set of all mappings from A to B. 

We shall also need countably infinite sequences of elements. Let w = {0,1, 2,... } be 
the smallest infinite ordinal and A any set. The elements of A^ are called uj-sequences. 
Thus an a;-sequence of elements of A is a mapping 

ip I uj —^ A 

which we may also write as 

(0(/9, lip, . . . , nip, . . . )n<u}- 

We conclude the section by considering operations. These are special mappings and 
are among the most fundamental concepts of algebra. Let m > 0. An m-ary operation 
on a set A is a mapping from A™' to A. li ip: A™' —)■ A is an m-ary operation on A, then 
ip assigns to every m-tuple (ai,... ,am) of elements of A a unique element of A which 
we write as ip{ai,... ,am). The number m is called the arity or the rank of ip. Most 
operations encountered in the usual algebraic systems (groups, rings, lattices etc.) have 
rank 0, 1 or 2. A few comments on these special cases; 

(i) A 0-ary operation ip: {0} —)■ A is completely determined by its only image ip fill), 
and often ip is given simply by naming this element. Note that here 0 may also be 
seen as the empty sequence of elements, and often one writes ip{ ), or just ip, for 
ipfib). 

(ii) When m = 1, we have a mapping from A to itself. Such operations are called 
unary. 

(iii) An operation of rank 2 is called a binary operation. For example, the addition and 
the multiplication in a ring are binary operations. In most such concrete examples 
one uses the infix notation for binary operations. Thus it is customary to write the 
ring operations in the form a + h and a-b instead of +(a, h) and -(a, b), respectively. 


9 



1 PRELIMINARIES 


A partial m-ary operation on a set A is a partial mapping from A™ to A. For any 
partial m-ary operation ip: A™ —>■ A and subset i? of A we have a partial mapping 

ip\B: B, 

where ip\B = (/? n (S™ x B). If (/? is an operation and B is closed with respect to tp, 
i.e., ip{ai ,..., am) G B whenever ai, am & B, then (p\B is an m-ary operation on B 
called the restriction of ip to B. Often the same symbol is used to denote an operation 
and its restrictions. 

Suppose we are given a set A, k m-ary operations ipi, ■ ■ ■ ,ipk on A and a A:-ary operation 
'll) on A (m, A: > 0). The composition of (pi, ipk with ip is the m-ary operation 
'iplipi ,..., ipk) defined so that 


7 ••• ■> Pk){(^l 7 ••• 7 ®m) — (®1) • • • ) (km) 7 ••• 7 Pk{^l7 ••• 7 (km)) 

for all oi, ..., Om G A. Note that the possibilities /c = 0 or m = 0 are included. If A: = 0, 
then the composition is an m-ary operation with the constant image ^^(0). If m = 0, 
then the composition is a 0-ary operation with the single value V’(</^i(0), • • • ,<^fc(0))- 
Let ip be an m-ary operation on a set A and Ai, ..., Am any subsets of A. Then we 
write 

^(Al 7 ■■■ 7 Am) — {pi(kl7 ■ ■ ■ 7 (km) I Ol £ 7 ■■■ 7 Om € Am}' 

Thus ip is extended to an m-ary operation on the power set pA. In general, there is no 
need to introduce a new notation for this extension. 


1.2 UNIVERSAL ALGEBRAS 

In this and the next section some concepts and results from universal algebra are sur¬ 
veyed. Universal algebra is an extensive field of mathematics, but we need really just 
certain basic parts of it. On the other hand, a good grasp of the material of these sections 
is essential to an understanding of the rest of the book. 

Generally speaking, an algebra (or a universal algebra) is a set together with a set of 
operations on this set. There may be a finite or an infinite number of operations, but we 
insist that they all are finitary, i.e., the ranks are finite as in the definition of operations 
given in the previous section. As a first example we consider the algebra of subsets of 
a given set U. In the power set pU we have several naturally defined operations. For 
example, there is a binary operation U that forms the union AuB of any two A, B £ pU. 
Similarly, we have the binary operation n that forms the intersection of two subsets of U. 
A unary operation is obtained if we map every A G pU to its complement A^ = U — A. 
Furthermore, we introduce two 0-ary operations, one that has 0 and one that has U as 
its image. Of course, an infinite number of operations could be defined on pU, but if we 
restrict ourselves to those defined above, we get the algebra 

{pU, u, n, ^ 0, u) 
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with two binary, one unary and two 0-ary operations. Note that we get such an algebra 
for each set U. In fact, all of these algebras can be viewed as special instances of a 
general class of algebras known as Boolean algebras. 

The example brings forth an important point. In algebra, and this will be the case 
here, too, one is generally not interested just in individual algebras, but rather in whole 
classes of algebras. Algebras in such a class are all “similar” in the sense that there 
is a natural correspondence between the operations of any two algebras of the class. 
Such a correspondence of operations is needed when one defines any concept, such as 
homomorphisms or direct products, involving more than one algebra. For example, the 
multiplications of any two groups correspond to each other, and a homomorphism of 
groups should preserve the multiplication. We shall now introduce a convenient vehicle 
to define such a class of similar algebras. 

Definition 1.2.1 An operator domain is a set S together with a mapping 

r: S —> No 

that assigns to every ct G S an arity, or rank, r{a). For any m > 0, 

Sm = {o- G S I r(cj) = m} 

is the set of the m-ary operators (or operational symbols). 

From now on S is an operator domain. The mapping r is usually not mentioned, but 
we denote by r(S) the set of all m > 0 such that 'Em 7^ 0- One can write E as the 
disjoint union Sq U Si U S 2 U ... from which the empty sets will be omitted. 

Definition 1.2.2 A E-algebra .4, is a pair consisting of a nonempty set A (of elements 
of .4.) and a mapping that assigns to every operator a G S an m-ary operation 

^A. 

where m is the arity of a. The operation cj'^ is called the realization of a in A. The 
mapping a 1 —)■ cj"^ will not be mentioned explicitly, but we write A = (A, S). The 
S-algebra A is finite if A is finite, and it is of finite type if S is finite. When S is not 
specified, or not emphasized, we speak simply about “algebras”. An algebra with just 
one element is called trivial. 

In general, A = {A, E), B = {B, S) and C = {C, S), possibly equipped with subscripts, 
will be S-algebras. The realizations of an operator cj G S in these algebras are denoted 
by and cr*', respectively. 

In the previous example of subset algebras we would have E = Sq U Si U 5]2 with 
(for example) Eq = {0,1}, Si = {-■} and S 2 = {A,V}. The algebra of the subsets of a 
set U is then the S-algebra A, where A = pU and the operators are realized as follows: 
0"^ = 0, = U, - 1 "^ = (complement in U), = Pi (intersection) and V"^ = U (union). 
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Note that the possibility m = 0 is not excluded when we consider generally an m-ary 
operation. For cj G Sq one often writes instead of cj'^( ) or (this involves the 

harmless confusion of a 0-ary operation and its value). When S = {ai,... ,ak} is finite, 
one usually writes A = {A, ai,... ,ak) instead of ^ = {A, S). 

We introduce now several concepts related to algebras. 


Definition 1.2.3 The S-algebra is a subalgebra of the S-algebra A if B C A and 
= a-^lB for all ct G S. 


If is a subalgebra of A, then B is a closed subset of A, i.e., ... ,bm) ^ B for all 

a G Era {'ITT' > 0) and bi, ..., bm & B. For every nonempty closed subset B of A, there is 
exactly one way to realize the operators on B in such a way that we get a subalgebra B 
of A: obviously every should be the restriction of the corresponding operation 

of .4 to B. Hence, a subalgebra is completely determined by its set of elements and one 
may call this subset a subalgebra. If ct is a 0-ary operator, then every subalgebra of A 
contains the element a-^. If Sq is empty, then 0 is a closed subset, but we do not count 
it among the subalgebras. 

It is easy to see that the intersection of any family of closed subsets of a given algebra A 
is again closed. Thus we have for any H C A a unique minimal closed subset containing 


H: 


[H] = p|(H \ H CBCA,B closed). 


If 7 ^ 0 or So 7 ^ 0, then [H] is also nonempty and thus a subalgebra. It is called 
the subalgebra generated by H. If Sq = 0, then [0] = 0. A generating set of .4, is a 
subset H C A such that [H] = A and A is said to be finitely generated if it has a finite 
generating set. It is clear that every finite algebra is finitely generated. 


Definition 1.2.4 A homomorphism from a S-algebra .4 to a S-algebra H is a mapping 
ip: A ^ B such that for all m > 0, u G Em and ai, ..., G A, 

cr'^(ai, ..., am)p = cr'^iaip,..., OmP). 

We write then p: A ^ B. This homomorphism is called 

(a) an epimorphism, if ip is surjective, 

(b) a monomorphism, if p is injective, and 

(c) an isomorphism, if p is bijective. 

If there exists an epimorphism from A to B, then B is said to be an epimorphic image 
of A. A monomorphism is also called an embedding. If there is an isomorphism from A 
to B, then A and B are isomorphic and we write A = B. Homomorphisms are often also 
called morphisms. 

If A = B, then A and B are the same algebra from the abstract point of view. An 
easy computation shows that the composition pfi> of two homomorphisms p: A ^ B 
and H C is a homomorphism from A to C. 
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1.2 Universal algebras 


A homomorphism is a mapping that is compatible with the operations of the algebras. 
For example, let A = (Z, +) be the algebra of the integers with the usual addition as the 
only operation, n > 1 and B = (Z„, +) the algebra where Z„ = {0,1,... , n — 1} and the 
sum is formed modulo n. Then the mapping ip: Z —)■ Zjj that maps every a € Z to its 
remainder rn(a) modulo n (0 < rn{a) < n) is an epimorphism from A to B. Of course, 
the homomorphisms defined in group theory, lattice theory etc. provide further general 
examples. 

The proof of the following lemma is straightforward and thus it is omitted. 

Lemma 1.2.5 Let p>: A^ B be a homomorphism. If C is a subalgebra of A, then Cp> 
is a subalgebra of B. If V is a subalgebra of B and D<p~^ is nonempty, then Dip~^ is a 
subalgebra of A. □ 

The following lemma contains an important observation. 

Lemma 1.2.6 Let (p: A^ B and A^ B be two homomorphisms and H a generat¬ 
ing set of A. If p>\II = 'f'\II, then pi = fj. In other words, a homomorphism is completely 
determined by its restriction to a generating set. 

Proof. Let C = {a G A | ap = atp}. Then If C C by the assumption. If m > 0, a G 
and ai, ..., am ^ C, then ..., am) G C: 

o-^(ai,..., am)p = (T^iaip,..., Omp) = cr^iaifj, ..., ami') = • • •, am)i- 

Hence C is closed and we get C = A. This implies p = i. □ 

We define now two concepts closely related to homomorphisms, namely congruences 
and quotient algebras. 

Definition 1.2.7 A congruence (relation) of A is an equivalence relation on A which is 
invariant with respect to all operations (a G S). A relation C A x A is said to be 
invariant with respect to an m-ary operation /: A™ —)■ A if 

f{ai,...,am) = f{bi,...,bm) (g) 
for all elements oi, ..., Om, bi, ..., bm ^ A such that 

Oil — bl, . . . , Um = bm (g)- 

The set of all congruences of an algebra A is denoted by C (A). 

Every algebra A has at least the trivial congruences Sa and /.a- For g G C'(A), the 
^j-class ag of an element a G A is also called a congruence class (modulo g). The partition 
A/Q ol A defined by the congruence classes is compatible in the sense that for all m > 0, 
a G Sm and aig, ..., amg H Ajg there is a class ag such that 

(T (ui^,..., amg) ^ ag. 
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Obviously, we can choose a = ... ,am)- It is also easy to see that an equivalence 

relation g G E{A) is a congruence of A only in case A/g is a compatible partition. In 
fact, in automata theory it is usual to deal with compatible partitions (also called SP 
partitions) rather than with congruences, but both concepts convey the same idea. 

The fact that A/g is a, compatible partition for any g G C{A) also justifies the following 
definition; the operations are well-defined. 

Definition 1.2.8 The quotient algebra A/g = {A/g,E) of a S-algebra .4, by a congru¬ 
ence g G C{A) is defined as follows. For any m > 0, cr G and ai, ..., am G 4. we 
put 

..., amg) = cr-^iai, ..., am)g- 

The definition of may be explained as follows. To compute a-^^^{aig,... ,amQ) 
one takes a representative from each of the ^^-classes, say oi, ..., am, computes for 
the representatives and forms then the ^>-class of the resulting element. 

Homomorphisms, congruences and quotient algebras are closely related to each other 
as the following three theorems show. 

Theorem 1.2.9 For any g G C{A), the natural mapping g^: a ag is an epimorphism 
A ^ A/g (the natural homomorphism^. 

Proof. We know that is a surjection from A to A/g so it suffices to verify that it is 
a homomorphism: for all m > 0, fi G Sm and oi, ..., am € A, 

a-^{ai,.. .,am)g^ = .. .,am)g = (T-^^^{aig,.. .,amg) = 

a'^/^{aig^,. ■ ■ ,arag^)- ° 

Theorem 1.2.10 The kernel of any homomorphism (p: A ^ B is a congruence 

of A. 

Proof. Consider any m > 0, a G Em and elements oi, ..., am, a'l, ..., a'm G 4 such 
that 

01 = 0 . 1 , . . . , am — (I'm (.TT )■ 

Then aip = a'lP, ..., amP = (('mP^ which implies cj-^(oi, ..., am)p = o'^iaiP, ■ ■ ■, a-mP) = 
a^{a[p ,..., amP) = o"^ia'i ,..., a'm)p- This means that a“^{ai ,..., am) = ■ ■ ■, dm) 

{pp~^) as required. □ 

Theorem 1.2.11 Every epimorphic image of an algebra A is isomorphic to some quo¬ 
tient algebra of A. 

Proof. Let p: A ^ B he an. epimorphism and 9 = pp~^ its kernel. We claim that 
B = A/9. The required isomorphism A/9 B is shown to be given by 

'i/: a9 ap {a £ A). 
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For any oi, 02 G A, 

aiOip = O 20 V’ iff 

iff ai = 02 (9). 

This shows that V' is well-defined (i.e., is independent of the choice of the represen¬ 
tative a ^ A oi the 0-class a9) and injective. Since ip is surjective, it is clear that if: is 
surjective, too. It remains to be shown that "0 is a homomorphism. Let m > 0, o € 
and oi, ..., am & A. Then 

..., amO)^) = o"^(ai,..., am)9i^ 

= o"^(ai,.. .,am)p 
= o^(aiV9,.. .,amp) 

= a'^{ai9'ip,... ,am9'ip). □ 

Taken together, Theorems ll.2.9l and ll.2.lT] sav that the epimorphic images of an algebra 
are exactly its qnotient algebras (when one does not distinguish between isomorphic 
algebras). 

Next, direct products of algebras are introduced. We may restrict ourselves to the case 
of a finite number of factors. 

Definition 1 . 2.12 The direct product of two S-algebras A and B is the S-algebra 

AxB={AxB,J:), 

where the operations are defined so that 

(J ((fli, 61 ), . . . , (Htti, hm)) — (u (di, . . . , Om)) CT ( 6 ]^, . . . , bmY) 

for all m > 0, (T G and (ai, 6 i), ..., (om, bm) ^ A x B. The [k > 0) direct power 
A^ of the S-algebra A is defined inductively: 

(i) A^ = ({ 0 },S) is the trivial S-algebra. 

(ii) A^~^^ = A^ X A for all k >0. 

It is easy to see that direct products are associative in the sense that {A x B) x C = 
Ax (B X C) for all A, B and C. Both of these products can be written simply as 
A X B X C and their elements may be identified with the triples (a, b, c) with a G A, 
b £ B and c £ C. More generally, one can define the direct product Mi x • • • x Ak of k 
{k > 0 ) S-algebras as an algebra with Mi x • • • x as its set of elements and operations 
performed componentwise. It is easy to see that the projections 

TTj: Ml X • • • X Mfc Mj, (di,..., dfc) 1-^ di 

{i = 1,. ■ ■ ,k) are epimorphisms from Mi x • • • x Ak to the respective factor algebras M,. 
Hence, every factor in a direct product is an epimorphic image of the direct product. 

We shall also need the following, perhaps, less usual, way to construct a new algebra 
from a given one. 
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Definition 1.2.13 The subset algebra (or power algebra) pA = (pyl, S) of a S-algebra 
A is defined as follows. If m > 0, cr € and Hi, ..., Hm G pA, then put 

Note that the singleton sets {a} (a G A) form in p.4. a subalgebra isomorphic to A. If 
So = 0, pA has the trivial subalgebra {0}. 

We conclude this section with a simple example illustrating these constructions. 

Example 1.2.14 Suppose S consists of one binary operator a and a nullary operator 
7 . Let A = ({a, 6 },S) be a S-algebra such that = a and (T'^(a,a) = a-^{a,b) = 
a^{b, a) = a, o"^{b, b) = b. Consider first the direct power .4.^ = .A x .4. If we write aa 
for (a, a) etc., then = aa and is given by the following multiplication table: 



aa 

ab 

ba 

bb 

aa 

aa 

aa 

aa 

aa 

ab 

aa 

ab 

aa 

ab 

ba 

aa 

aa 

ba 

ba 

bb 

aa 

ab 

ba 

bb 


Let us now construct the subset algebra. The value of the 0-ary operation is = {a} 
and the operation is given by the table below. 



0 

{«} 

{b} 

{a,b} 

0 

0 

0 

0 

0 

{a} 

0 

{«} 

{«} 

{a} 

{b} 

0 

{«} 


{a, 6 } 

{a,b} 

0 

{«} 

{a,b} 

{a, 6 } 


1.3 TERMS, POLYNOMIAL FUNCTIONS AND FREE 
ALGEBRAS 

The concepts “term” and “polynomial function” are all-important in our modelling of 
the theory of tree automata. Let us consider an introductory example. An expression 
like (x + y){y A z), such expressions are called terms, represents in a natural manner a 
function of the three variables x, y, and z. Two things should be pointed out here. First 
of all, the term defines such a function in any algebra with operations denoted by the 
operators appearing in the term. In our case it could define, for example, a mapping 
—7> Z or a mapping ^ R depending on whether the addition and multiplication 
are interpreted as those of integers or those of real numbers. Generally speaking, the 
terms are determined by the operator domain, but they define operations in all algebras 
with that operator domain. Secondly, we note that the term not only defines a function, 
but it also describes a way to compute its values from the values of the variables once 
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the operations of the algebra in question are known. In fact, algebras can be viewed as 
devices that evaluate terms. When we interpret (in Chapter [2|) terms as trees, the step 
from algebras to tree automata is not long. 

From now on, X will be a set disjoint from the operator domain S. The elements of 
X are called variables. Other symbols used for sets of variables are Y and Z. 

Definition 1.3.1 The set Fj]{X) of T-terms in X, or YX-terms for short, is dehned as 
follows: 

(i) X c Fe(X), 

(ii) cr{ti ,..., tm) € Ts(X) whenever m > 0, a G Ym and ti, ..., tm G ^^(X), and 

(iii) every EX-term can be obtained by applying the rules (i) and (ii) a hnite number 
of times. 

If fj is a 0-ary operator, then we get by rule (ii) the SX-term a{ ). It is convenient 
to write just a for such a term. Then the definition of Fs(X) may be reformulated as 
follows. 

Definition 1.3.1’ The set Fs(X) of SX-terms is defined as follows: 

(i) XUEoCFs(X), 

(ii) cr{ti,... ,tm) G Fs(X) whenever m > 0, a G Ym and ti, ..., tm G ^^(X), and 

(iii) every EX-term can be obtained by applying the rules (i) and (ii) a hnite number 
of times. 

When E and X are unspecihed or unemphasized, we shall speak simply about terms. 
The inductive dehnition of Fy:{X) suggests a useful method to deal with terms. It could 
be called term induction. If we want to dehne a property or quantity c(t) for every 
EX-term t, it suffices 

(i) to dehne c{t) for all t G X, and then 

(ii) to give a rule how to determine c{cr{ti,... ,tm)) in terms of a {g Ym) and c(ti), 
..., c{tm) {m > 0). 

Sometimes the variation suggested by Dehnition 11.3.11 is more convenient: in (i) one 
dehnes c{t) for t G Yq, too, but in (ii) one can then restrict oneself to values m > 0. 
Proofs by term induction can be modelled according to the same pattern. 

Note that Fs(X) is empty iff Eq = X = 0. Since we do not want to consider this 
uninteresting case separately every time, we shall tacitly assume that always EqUX 7 ^ 0. 

Example 1.3.2 Let E = Eq U Ei U E 2 , where Eg = {p}, Ei = {r} and E 2 = {o'}. 
If X = {x,y,z}, then x, z, p, t{z), t{p), a{z,T{p)) and t = a{x,a{z,T{p))) are some 
examples of EX-terms. □ 
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A SX-term t is evaluated in a given S-algebra as follows. First we assign a value 
xa G A to every variable x G X. Then the operations of A are applied to these elements 
as indicated by the form of t. For example, given a mapping a: X A, the t of the 
previous example would yield the element 

a'^{za, 

Of course, the result depends on the choice of a, too. This evaluation process can be 
formalized as follows. 

Definition 1.3.3 With every S-algebra A and SX-term t we associate a mapping 

t-^: A^ ^ A 


as follows: for any a: X ^ A 

(i) x'^(a) = xa {x G X) and 

(ii) t-^{a) = when t = a{ti, {m > 0, a G Dm, h, ..., 

tm G The mappings are called the polynomial functions of A in vari¬ 

ables X and their set is denoted by Px{A). 

It may seem strange that the polynomial functions G Px{A) are evaluated on 
mappings from X to A, but this is, in fact, just a modification of the usual way to 
express polynomial functions. When one writes the value of a polynomial function in 
the form p{ai ,..., o^), a given order of the variables is assumed, say X = {xi,..., x^}, 
and the n-tuple (oi,... ,an) is just a convenient way to give the mapping a \ X ^ A 
such that Xia = 0 * (f = 1, ..., n). 

In a sense, the polynomial functions of an algebra A are the operations one can derive 
by composition from the basic operations (a G S) of A, and they share many 
properties with these. This is exemplified by the following four lemmas. 

Lemma 1.3.4 If B is a subalgebra of the E-algebra A and a: X ^ A a mapping such 
that Xa C B, then t'^{a) G B for all t G Fj^{X). □ 

The lemma states, in other words, that subalgebras are closed with respect to polyno¬ 
mial functions. The proof is a simple exercise in term induction quite similar to that of 
the next lemma which expresses formally the fact that congruences are invariant with 
respect to polynomial functions. 

Lemma 1.3.5 Let 6 be a congruence of the E-algebra A and a: X ^ A, P: X ^ A 
two mappings such that 

xa = x/3 (6) for all x G X. 

Then t-^{a) = {9) for all t G Fj^{X). 
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Proof. We proceed by term induction on t. 111 = x tz X, then 

t'^{a) = xa = xfl = {9). 


Let t = cr{ti ,..., tm) and suppose 

t'f{a) = tf{l3) {6) for alH = 1, ..., m. 

Then also 

t^{a) = t^{a)) = , t^i/3)) = f^{/3) {6) 


as 0 is a congruence. Here the possibility m = 0 can be allowed as a trivial special case. 

□ 


Lemma 1.3.6 Let ip: A ^ B be a homomorphism of T-algebras. Then 

t'^{a)p = t^(ap) 

for each mapping a: X ^ A and each TX-term t. □ 

Lemma 1.3.7 Let A and B he T-algebras, and a: X ^ A and /3: X ^ B any map¬ 
pings. If we define a mapping y: X ^ A x B by putting 

xj = {xa, x/3) for all x € X, 

then 

= (t-^(a),t®(/3)) for all t € F^{X). □ 

Lemmas 11.3.61 and 11.3.71 can easily be verified by term induction. 

The subalgebra generated by a subset can also be described in terms of polynomial 
functions. 

Lemma 1.3.8 For any subset X of a T-algebra A we have [X] = {t'^{ax) \ t € Fj:{X)}, 
where ax = 1a\X, i.e., ax is the mapping from X to A such that xax = x for all x € X. 

Proof. Denote {t-^{ax) \ t G by C. For every x & X, x = xax = x-^{ax) G C. 

Hence X C C. Also, C is closed under the operations of A: 


a^{tf{ax)-,---Am{ax)) = a{ti, ... ,tm)'^{ax) G C 

for all m > 0, (T G and ti, ..., tm H Fy,{X). Lemma 11.3.41 implies that C C B for 
every subalgebra B which contains X. Hence C = [X]. Note that the result is true even 
if So = X = 0. In this case [X] = 0. □ 

We shall now turn to the S-algebra formed by the SX-terms. 
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Definition 1.3.9 The S-algebra = (Ts(^),S) defined so that 

=a{ti,...,trn) 

for all m > 0, cr G and ti, ..., tm ^ F^{X), is called the EX-term algebra or the 
free E-algebra generated by X. 

We shall first account for the name “free algebra”. 

Definition 1.3.10 Let iL be a class of S-algebras. A S-algebra J- = {F,E) is said to 
be freely generated over AT by a subset A C F, if the following conditions are satisfied: 

(i) F G AT. 

(ii) X generates F. 

(iii) Every mapping a: X A oi X into any algebra .4, in AT has an extension to a 
homomorphism a: F ^ A. 

If these conditions are satisfied for some subset X of F, then F is called a free algebra 
over K (with |A| generators), and X is called a free generating set. 

A well-known example is provided by the free semigroup X~^ generated by a set (al¬ 
phabet) X. The elements of X'^ are all the finite nonempty strings of elements of X. 
The product of two such strings u and v is simply their concatenation uv. The associa¬ 
tivity of this product is obvious and thus X~^ is a semigroup. As every string u G X~^ 
is obtained by concatenating individual elements of A, it is clear that A generates A"*". 
To prove that A"*" is freely generated by A over the class of all semigroups we consider 
any semigroup S and mapping a: A —)■ A. The required (unique) homomorphism 

a: X+ ^ S 


is obtained by putting 

{xiX2 ■ ■ ■ Xk)di = {xio) ■ {x2a) • ... • {xko) 

for all xiX 2 ■ ■ - Xk G A+ (products to the right are formed in 5). 

Free semigroups are considered later again, but we return now to our term algebras. 

Theorem 1.3.11 The EX-term algebra Fs(A) is freely generated by X over the class 
of all E-algebras. 

Proof. That A generates Fs(A) is quite obvious when we compare the definitions of 
Fj]{X) and Fs(A), but it follows also from the useful observation that 

(ax) = t for all t G F^{X) {*) 
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(where ax = Ifs(x)I^)- The proof of (*) goes again by term induction. Let A be any 
S-algebra and a: X ^ A any mapping. We claim that the mapping 

a:Fj:(X}^A, t f^(a) (t e Fj:(X)) 

is the required homomorphism. For every x €z X, xd = x'^(a) = xa. Hence, d\X = a. 
It remains to be verihed that S is a homomorphism. Indeed, 

■ ■ ■,tm)d = (T{ti, . . .,tm)-^{a) 

= .. .,tmd) 

for all m > 0, CT € and ti, ..., tm € Fj^{X). □ 

We add a few general comments on free algebras. First of all, one should note that 
the homomorphic extension d: F ^ ^ of a mapping a: X ^ A [A ^ K) is unique. 
This follows from Lemma 11.2.61 Free algebras over a given class do not always exist, 
but when they do, they are determined up to isomorphism by the cardinality of the free 
generating set. This is stated formally in the following lemma. 

Lemma 1.3.12 Any two algebras freely generated over the same class of algebras by 
sets of the same cardinality are isomorphic. 

Proof. Suppose A and B both are free over the same class K and that they have free 
generating sets X and Y, respectively, such that |X| = |y|. Then there is a bijection 
a: X Y . The converse of it, /3 = a~^, defines a bijection from Y to X. Now there 
exist morphisms 

d: A^ B and (3-. B ^ A 
such that d\X = a and /3|y = /3. But then 

a/3 ■. A^ A and fid: B ^ B 

are homomorphisms such that dfl\X = lx and l3d\Y = ly. This means by Lemma fl.2.61 
that a/3 = 1a and fid = 1b- Hence, d and fl are isomorphisms inverse to each other. 
This implies A = B. □ 

Lemma 11.3.121 allows us to speak about the algebra freely generated over a class K by 
a set X. 

We shall hx the notation d used above for the rest of the book: for any .4. and a: X—^A, 
d: Fj](X) —>■ .4 is the homomorphism such that d\X = a. To evaluate a SX-term t in a 
S-algebra A for a given assignment a: X —>■ 4 of values to the variables amounts to the 
computation of td. Indeed, we showed in the proof of Theorem 11.3.111 that t'^(a) = td 
for all A, a and t. 

The polynomial functions in variables X of an algebra A are the mappings one can get 
from the “projections” (x G X) by iterated compositions with the basic operations 
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(cr € S). If the generating set of functions is enlarged by the set of all constant 
mappings (c G A) 

7c: A, a^c (a&A^), 

then we get, in general, a larger class of functions. These are called algebraic functions. 
We shall need just the unary (i.e., 1-place) algebraic functions and these only are defined 
below. In this special case X is a singleton {x} and we may identify any mapping 
a: X ^ A with the element xa G A. Then the unary algebraic functions can be defined 
simply as certain mappings from A to A. 

Definition 1.3.13 The set of unary algebraic functions Alg^(^) of a S-algebra A is 
defined as follows: 

(i) 1a S Alg]^(.4). 

(ii) For every c G T, Alg]^(.A) contains the constant mapping 7 c: A —A, a :->■ c 
(a G A). 

(iii) The composition cr"^(/i,..., fm) is in Alg;^(A) whenever m > 0, a € and /i, 
■ ■ ■, fm ^ Alg;^(A). 

(iv) All members of Alg;^(A) are obtained by the rules (i)-(iii). 

The constant mapping jc (c G A) is usually denoted simply by c. It is intuitively 
clear from Definition II. 3.131 that every / G Alg^(A) can be represented by an expression 
similar to the terms that gave the polynomial functions. Let X = A U {x} (x ^ A). 
Following the inductive form of Definition 1 1.3.1 3l we associate with every / G Alg^(A) a 
SX-term tf as follows: 

(i) = X. 

(ii) tc = c for all c (= 7 c) (c G A). 

(iii) If / = cr-^(/i,... ,/m), then tf = cr{tf^,... ,tf^). 

It is now an easy task to verify that the following lemma holds. 

Lemma 1.3.14 For every f G Alg;^(A) there exists a term tf G Fy,{A U x) sueh that, 
for all a & A, 

/(«) = tfitta) 

when Ua is the mapping sueh that aa\A = 1 a and xaa = a. □ 

The assignment Oa depends on o G A only. We may think of tf as a SX-term for a 
suitable X, in which all variables, save x, have been assigned constant values from A. 
In other words, the unary algebraic functions are obtained from polynomial functions 
by fixing the values of some variables. It is now obvious, in view of Lemma 11.3.51 that 
congruences of A are invariant with respect to unary algebraic functions. The converse 
of this observation holds also. In fact, it can be stated in a stronger form in terms of the 
special unary algebraic functions introduced in the following definition. 
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Definition 1.3.15 A mapping /: A —)■ A is called an elementary translation of the S- 
algebra A, if there exist an m > 0, a ct G a j (1 < j < m) and elements ci, ..., 
Cj_i, Cj+i, ..., Cm G A such that 

/(a) = cT-^(ci,... ,Cj_i,a,Cj+i,... ,Cm) for all a G A. 

The set of all elementary translations of A is denoted by ET(A). 

It is obvious that ET(A) C Alg;^(A). 

Lemma 1.3.16 An equivalence relation 6 G E{A) is a congruence of A iff 6 is invariant 
with respect to all elementary translations of A. 

Proof. Suppose a = b{9) implies /(a) = f{b){9) for all a, b ^ A and / G ET(A). 
Consider any m > 0, a G Sm and elements oi, ..., Om, bi, ..., 6m £ A such that 
m = 6i, ..., Om = 6m (9). Define the following m elementary translations: 

/i(0 = (T-^{bi,... ,bj-iA,aj+i,--- ,am) {j = 

Then 

cr'^(ai,a2,... ,am) = /i(ai) = /i(^i) (0) 

= /2(a2) = / 2 ( 62 ) (0) 


— (^) 

= 0 -^( 6 i, 62 ,.--,&m)- 

Hence cj-^(ai,..., am)9a-^{bi,..., bm) and we have verified that 9 G C{A). The converse 
is obvious. □ 

1.4 LATTICES 

We shall need a few facts from lattice theory, and these are quickly surveyed here. 

Definition 1.4.1 Let A be a set. A relation C A x A is called a partial ordering of 
A, if 

(1) is reflexive), 

(2) qH q~^ C 5a {q is antisymmetric), and 

(3) QQ ^ Q {q is transitive). 

If is a partial ordering of A, then (A, q) is called a poset. 
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The usual symbol for a partial ordering is <. Often a set A is called a poset when a 
certain partial ordering of A is understood. 

An example of a poset is (pS*, C), where 5 is a set and C the usual subset relation 
in the power set pS. Another simple example is (N, <) where < is the “less than or 
equal” -relation of natural numbers. This < is a total ordering, which means that any 
two elements of the poset are comparable, i.e., either a < b or b < a holds for any two 
elements a and b. A poset {A, <) in which < is a total ordering is called a chain. 

Let {A, <) be a poset and a, b € A. We may write a > b when b < a, a < b when 
a < b and a ^ b, and a > b when a > b and a ^ b. Clearly > is a partial ordering and 
the poset (A, >) is said to be dual to (A,<). Each one of the relations >, < and > 
determines < completely. 

An element a € A is an upper bound of a subset H C A if 6 < a for all 6 G Lf. An 
upper bound a of Lf C A is the least upper bound, or the supremum, of iL, if a < c 
for all upper bounds c of H. Lower bounds and greatest lower bounds (infimums) are 
defined similarly. The least upper bound and the greatest lower bound of a subset H 
are denoted, respectively, hy \/ H and /\ H. In case of an indexed family (a* | i G I) of 
elements the notations \/{oi \ i G I) and /\{ai \ i G I) may be used. 

An element c G A is a zero element of the poset A if c < a for every a G A. If a 
poset has a zero element, it is unique and usually it is denoted by 0. Similarly, the unit 
element 1, is dehned by the condition that a < 1 for all a G A. Clearly, /\ A exists iff the 
poset has a zero element 0, and then /\ A = 0. Similarly, \J A exists, and then equals 1, 
iff A has a unit element 1. 

Definition 1.4.2 A poset (A, <) is a lattice, if \J{a, b} and /\{a, b} exist for all a, b G A. 
It is a complete lattice, ii\/ H and /\ H exist for all subsets H of A. 

In a lattice one usually writes aV6 and a Aft for \/ {a, ft} and /\{a, ft}, respectively. The 
element a V ft is also called the join of a and ft, and a A ft is the meet of a and ft. It is easy 
to see that V H and /\ H exist for every finite, nonempty subset LI of a lattice. However, 
V 0 exists only in case the lattice has a zero element 0. Then \/ 0 = 0. Similarly, /\ 0 
exists iff the lattice has a unit element 1; then /\ 0 = 1. 

The following lemma follows directly from the dehnitions of the join and the meet. 

Lemma 1.4.3 If (A, <) is a lattice then A and V satisfy the following identities: 

(LI) X A X = X, xW X = X (idempotence). 

(L2) X Ay = y Ax, x M y = y y x (commutativity). 

(L3) X A {y A z) = {x A y) A z, x y {y y z) = {x y y) y z (associativity). 

(L4) X A {x y y) = X, X y {x A y) = X (absorption). □ 

The identities (L1)-(L4) are characteristic of lattices in the following sense. If (A, A, V) 
is an algebra with two binary operations that satisfy these identities, then (A, <) is a 
lattice when < is dehned so that 

a<b iff a Aft = a (a,feGA). 
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In this lattice \/{a, b} = a \/ b and f\{a, 6} = a A 6 for all a, b G A. In lattice theory 
lattices are usually defined and considered in parallel both as posets and as algebras. 
The two aspects of the theory complement each other. 

The following lemma is often useful when one wants to show that a certain poset is a 
complete lattice. 

Lemma 1.4.4 A poset {A, <) is a complete lattice, if f\H exists for each subset H C A. 

□ 

Note that the existence of /\ 0 = 1 should also be ascertained when Lemma 11.4.41 is 
used. We shall now apply the lemma to an important example. Let vl be a set. It is 
easy to see that the intersection n(^i I * € /) of any equivalence relations Si {i G I) of A 
is again in E{A). This means that 

/\{£i \ i ^ I) — I * € 

always exists in the poset {E{A), C). (In particular, /\^ = la-) Hence, we get 

Lemma 1.4.5 For each set A, {E{A),G) is a complete lattice. □ 

In general, the union of equivalence relations is not an equivalence relation. For any 
H C E{A), \J H is the intersection of all equivalence relations which contain the union 
U LI. A more useful description of the supremum is given in the following lemma. 

Lemma 1.4.6 Let H C E[A) and a, b G A. Then a = b(\J H) iff there exist an n >0, 
El, ..., £n G H and ai, ..., a^-i G A such that 


a E\ a\ E 2 a ‘2 ... a^i — b. 


□ 


The lemma may be used to prove the following important fact. 

Theorem 1.4.7 For any algebra A = (A, S), C{A) forms a complete sublattice of 
{E{A), C), that is to say, \J F4 G C{A) and /\H G C{A) whenever FI C C{A). □ 

The direct product {Li x • • • x <) of posets (Li, <), ..., {L^, <) is a poset when 
we define < in Li x • • • x so that 

(ai,...,an) < (6i,...,6n) iff ai<bi for alH = 1, ..., n. 

If the {Li, <)’s are lattices, then the direct product is also a lattice in which 

(cii,..., aff) V {bi, ... , bn) — (oi V 6i, ..., an V bn) 

and 


(ai,... ,an) A (5i,...,5^) = (oi A 6i,...,A 6„). 

An ideal of a lattice {A, <) is a nonempty subset / of A such that, for all a, b G A, 
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(1) a, b G I implies aV b G I, and 

(2) a < b G I implies a G I. 

A dual ideal of a lattice (A, <) is a nonempty subset D oi A such that, for all a, b G A, 
(!’) a, b G D implies a Ab G D, and 
(2’) a> b G D implies a G D. 

General examples are provided by the 

(i) principal ideal {a\ = {x G A \ x < al} generated by an element a G A, and by the 

(ii) principal dual ideal [a) = {x G A | x > a} generated by an element a G A. 

Let A and B be posets. A mapping ip: A ^ B is said to be isotone, if 

(Vai, 02 G A) oi < 02 aip < a 2 P- 

Suppose now that A and B are complete lattices. The mapping p is u-continuous, if 

\/{ai\i> R)p = \l {aip \ i > 0) 
for every ascending cj-sequence 

of elements o^ G A (0 < i < cu). An cj-continuous mapping is always isotone, but the 
converse is false. 

Let A be a poset and (/?: A —)■ A a mapping. An element o G A is a fixed-point of 
p, if ap = a. It is the least fixed-point of p, if all other fixed-points of p are above 
it. Of course, there can be at most one least fixed-point. A well-known theorem by 
A. Tarski states that every isotone mapping in a complete lattice has a fixed-point. For 
cj-continuous mappings the following stronger result holds. 

Theorem 1.4.8 Let (A,<) he a complete lattice and p: A^ A an ui-continuous map¬ 
ping. Then 

[p\ = y (0v9* I i > 0) 

is the least fixed-point of p. 

Proof. Since p is isotone, 0 < 0(/5 implies 

0 < 0(^ < < .... 

By w-continuity, we get now 

[p\p = \Jfi)p'^+^ I i > 0) = \/(0v?* I i > 0) = [p\. 

For any fixed-point a oi p, D < a implies 

Up < ap = a, 

and in general by induction on i > 0, 0(/3* < a. Hence \p\ < a, and [p] is the least 
fixed-point of p. □ 
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1.5 FINITE RECOGNIZERS AND REGULAR LANGUAGES 

In this section several basic concepts and facts from the theory of finite automata are 
reviewed. For many readers there is probably nothing really new. The presentation 
is quite telegraphic and proofs are sketched at most. Much of the material will be 
generalized to tree automata in Chapter [51 and the present section is intended mainly 
as an outline of the proper background scenery. 

An alphabet is a finite nonempty set of symbols which are called letters. We shall 
usually use the letters X, Y and Z to indicate alphabets. A finite string of letters from 
an alphabet X is called an X-word or a word over X. Consider an arbitrary X-word 

W = XiX 2 . . .Xn (n > 0, Xi, ..., x„ € X). 

Here Xi = xj is possible even for i ^ j. If n = 0, then w is the empty word which is 
denoted by e. The length of w is n and we write it \w\. Obviously, |tc| = 0 iff tc = e. The 
set of all A-words is denoted by X*, and the set of all nonempty A-words is denoted 
by A"*“. The letters of an alphabet are viewed as indivisible symbols. This means, in 
particular, that for any m > 0, n > 0 and xi,..., Xm, Hi, ■ ■ ■ ,yn ^ X, 

XiX2 ...Xm= 2/12/2 • • • 2/n 

holds just in case m = n and Xj = yi for all i = 1,..., m. Letters are considered words of 
length 1. Hence, we may write A C A'*' C X* and A* = A"*" U e. 

In Section 3 we noted that A"*" is the free semigroup generated by A, when the product 
of two words is defined to be their catenation. Similarly, X* is the free monoid generated 
by A. The identity element is the empty word: ew = we = w for each w G A*. 

A language over A, or an X-language, is simply a subset of A*. An A-language is 
e-free if it does not include the empty word. Of course, formal language theory concerns 
itself with such languages only that can be specified in some effective manner. 

A family of languages L is defined by indicating for each alphabet the set /2(A) of 
A-languages belonging to the family. For example, /2(A) could consist of all languages 
recognized by automata of a given type with input alphabet A. If L G /2(A), one may 
write just L G C. Two families of languages K. and /2 are equal, which we write K, = C, 
if A2(A) = /2(A) for every alphabet A. Similarly, the inclusion K, Q C means that 
/C(A) C /2(A) for every A. 

One way to specify a language L C A* is to give an automaton that can examine any 
given A-word and then tell whether the word is in L or not. Such automata are called 
recognizers. The most basic type of recognizers is the following: 

Definition 1.5.1 An X-recognizer (also called a Rabin-Scott recognizer) A consists of 

(1) a finite (nonvoid) set A of states, 

(2) the input alphabet X, 

(3) a next-state function <5: A x A —> A, 
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(4) an initial state qq G A, and 

(5) a set A' 'L A oi final states. 

We write A = (A, X, 6, oq, A'). 

If the A-recognizer A of Definition 11.5.11 is in state a (g A) and receives the input 
X (g A), it enters state S(a,x) and remains in this state until it reads the next input 
letter. The next-state function is extended to a function 

<5: A X A* ^ A 

as follows: 

1° 6(a, e) = a for each a G A, and 

2° S(a,wx) = S(6(a,w),x) for all a G A, tc G X* and x G X. 

We will omit the cap from 6. For any a G A and w G X*, 6(a, w) is the state of A when 
it has read the whole input word w, from left to right, and the state in the beginning 
was a. As a language recognizer A operates as follows. The word w to be tested for 
membership is entered to A so that the state of A initially is ao- Now w is accepted 
by A if 6{ao,w) is a final state. Otherwise w is said to be rejected by A. The language 
recognized by A consists of all A-words accepted by A, i.e., it is the A-language 

T(A) = {rc G X* I 6{ao,w) G A'}. 

An A-language L is called recognizable, if there exists an A-recognizer A such that 
L = L{A). The family of recognizable languages is denoted by Rec, and Rec A denotes 
the set of all recognizable A-languages. 

In the definition of A-recognizers the finiteness of the state set is essential. Otherwise, 
every A-language would be recognizable. 

We shall now prepare for the first of the many characterizations of recognizable lan¬ 
guages. 

The product of two A-languages U and V is the A-language 

UV = {uvIuGU,vG V}. 


The product is associative: 

U{VW) = {UV)W for all U,V,W C X*. 

Furthermore, 

U$ = m = 9 and U{e} = {e}U = U 
for every A-language U. 

The powers (n > 0) of an A-language U are defined inductively: 

1° [/O = {e} and 

2° = U'^-^U for n > 0. 
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By means of the powers we may define the iteration of U 

U* = \J{U^ I n > 0). 

Excluding U^, we get the language 

U+ = \J{U^ I n > 1). 

Clearly, U* = U~^ U {e}, and = C/* iff e G [/. A word w G X* belongs to U* iff it 
can be expressed in the form w = uiU 2 ■ ■ ■ Un, where n > 0 and tti,.. . G 17. 

Note that X” is the set of all X-words of length n (n > 0) and the set X* of all 
X-words really is the iteration of X (when X is viewed as the set of X-words of length 
!)• 

Union, product and iteration are called the regular language operations. 

Definition 1.5.2 The set Reg X of regular X-languages is the smallest set R such that 

1° 0 G i? and {x} G R for each x G X, and 
2° U,V eR implies UUV, UV, U* G R. 

Regular languages are also called rational languages. All finite languages are regular. 
Hence Reg X is the smallest set of X-languages containing the finite X-languages which 
is closed under the three regular operations. 

The form of Dehnition 11.5.21 implies that every regular X-language can be represented 
by a regular expression which shows how the language is obtained from 0 and the lan¬ 
guages {x} by forming unions, products and iterations. 

Example 1.5.3 Let X = {x,y}. Some members of Reg X are 0, {x}, {y}, {xy} = 
{x}{7/}, {xy,yy} = {x}{y} U { 2 /}{ 2 /} = ({x} U {y}){y} and 

U = {x^y^ I * > 1, j > 0} U {yx^^ | A: > 0}. 

A possible regular expression for the language U would he y = (x(x)*(y)*) -|- (y(xx)*) 
(usually ‘-I-’ is used for union). If we agree on the usual hierarchy of regular operations 
(first iterations, then products, and unions last), then some parentheses can be omitted 
and y becomes xx*y*+y{xx)*. The language U is recognized by the X-recognizer defined 
by the state graph of Fig ll.ll (the initial state is oq and the final states are a, b and c). 

□ 

The following theorem is one of the cornerstones of finite automaton theory. 

Theorem 1.5.4 (S. C. Kleene 1956) Rec = Reg. □ 

The theorem is effective in the following sense. There are algorithms to construct a 
recognizer for any regular language given by a regular expression. Conversely, a regular 
expression representing L(A) can be found for any given recognizer A. 

Kleene’s theorem implies also that the family Rec is closed under the regular opera¬ 
tions. We shall present some more closure properties of the family Rec. 
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Theorem 1.5.5 Let X and Y be arbitrary alphabets. 

(a) IfU,V e Rec X, then UnV, U-V eRecX. 

(b) If U is a recognizable X-language, then so is its mirror image (or reversal^ 

mi{U) = {xn ... X 2 X 1 I n > 0, X 1 X 2 .. .Xn ^ U (xi G X)}. 

(c) If U and V are recognizable X-languages, then so are the quotient languages 

U~^V = {re G X* \ uw = v for some u gU,v gV} 

and 

UV~^ = {w G X* I wv = u for some u GLf,v gV}. 

(d) Let ip: X* Y* be a homomorphism (of monoids). If U G Rec X, then Up G 
Rec Y. IfV G Rec Y, then Vp-^ G Rec X. 

(e) IfUG Rec X and p: pX* pX* is such a substitution mapping that xp G Rec Y 

for all X G X, then Up G Rec Y. □ 

Recall that a mapping p: pX* —> pX* is a substitution, if 

1° {e}p = {e}, 

2° {wx}p = {wp){xp) for all w G X*, x G X, and 
3° Up = D{up \uGU) for all U C X*. 

Obviously, the substitution is completely defined when the languages xp {x G X) are 
given. Extended to mappings of languages, homomorphisms p: X* —)■ X* are special 
substitutions for which every xp {x G X) consists of exactly one word. 

Often it is convenient to allow a recognizer to be nondeterministic. In a nondeterministic 
X -recognizer A = {A,X,5, Aq, A') the next-state function is a mapping 

5: Ax X ^pA. 

Also, the recognizer has a set Aq C A oi initial states. If A receives in state a the input 
letter x, then it may enter any one of the states in S{a,x). The operation of A may be 
started in any initial state oq G Aq. A word w = X1X2 .. .Xn (n > 0,xi,..., G X) is 
accepted by A if there is such a choice of states oq, oi,..., that 
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(i) 

Q-O £ ^0) 


(ii) 

Oj G S{ai-i,Xi) for alH = 1,.. 

.,n, and 

;iii) 

On G A'. 



The mapping 5 extends to a mapping 

S: pAx X* ^ pA 


as follows: 

1° 6{H, e) = H for all H C A, and 

2° 5{H,wx) = (J((5(a, x) | a G 5{H,w)) for all H C A,w ^ X* and x G X. 

Obviously, 5{H,w) is the set of states A may reach under the input word w from at 
least one state in H. The language recognized by A can now be defined formally as 

L(A) = {u; G X* I 6{Ao,w) nA'^ 0}. 

Every X-recognizer may be interpreted as a nondeterministic X-recognizer A, where 
Aq and the sets 6{a,x) all are singletons. On the other hand, every nondeterministic 
X-recognizer A may be turned into the equivalent X-recognizer 

B = {pA,Xj,Ao,A"), 

where A” = {H G pA | HCiA' ^ 0}; this is the well-known “subset construction”. Hence, 
a language can be recognized by a nondeterministic recognizer iff it is recognizable in 
our original sense of the word. 

Now we recall some algebraic characterizations of Rec. 

An equivalence relation q on a semigroup <S is a right congruence, if agb implies acgbc 
for all a,b,c G S. Every X-recognizer A = {A,X,S,ao,A') defines a right congruence 
gA of the free monoid X* as follows: 

u = v{gA) iff d{ao,u) = S{ao,v) {u,v G X*). 

The index of gA is at most |A| and 

L(A) = I u G X*,(5(ao,n) G A'). 

This shows that every recognizable X-language is saturated by a right congruence of X* 
of finite index. 

Suppose now that the X-language L is saturated by a right congruence g of X* of 
finite index. The X-recognizer 


A = {X*/g,X,5,eg,L/g), 
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where 6 is defined by the condition 

S{uQ,x) = {ux)q {u G X*,x € X), 


is then well-defined and 


6(eg, u) = UQ 


for each u G X*. This implies L{A) = L £ Rec X. Among all right congruences of 
X* saturating a given X-language there is a greatest one which is called the Nerode 
congruence of L. We denote it by ql and it can be defined by the condition that 


u = V (ql) iff (Vrc G X*) {uw G L vw G L) 


for all u,v £ X*. From these observations it is easy to construct a proof for the following 
theorem. 


Theorem 1.5.6 (A. Nerode 1957). For any X-language L the following three conditions 
are equivalent: 

(1) L £ Rec X. 

(2) L is saturated by a right congruence of X* of finite index. 

(3) The Nerode congruence ql is of finite index. □ 

There is a similar characterization which uses congruences of X*. Every X-recognizer 
A defines a congruence 6a of X* of finite index which saturates L{A): 

u = v {9 a) iff (Va G A) S{a,u) = 5{a,v). 

If L C X* is saturated by a congruence, then a recognizer for L can be constructed 
as above in the case of right congruences. The greatest congruence 9l saturating L is 
called the syntactic congruence of L. It may be defined by the condition that 

u = V {6l) iff (Vrc, w' £ X*) {wuw' £ L wvw' £ L) 


for all u,v £ X*. 

Theorem 1.5.7 (J. R. Myhill 1957). For every X-language L the following three con¬ 
ditions are equivalent: 

(1) L £ Rec X. 

(2) L is saturated by a congruence of X* of finite index. 

(3) The syntactic congruence 9 l is of finite index. □ 
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Let 6 he a congruence of X* saturating an X-language L. Then L = {L6^)6^ where 

9^:X* -^X*/e 

is the canonical homomorphism, and X* j6 is finite iff 9 is of finite index. This applies, 
in particular, to the syntactic congruence 9l- The monoid X*I9l is called the syntactic 
monoid of L. On the other hand, if we have a finite monoid M, a homomorphism 

g>: X* ^ M 

and a subset H C M for which L = then is a congruence of X* of finite 

index saturating L. It is now clear that Myhill’s theorem can be reformulated as follows. 

Theorem 1.5.8 For any X-language L the following three conditions are equivalent: 

(1) L G Rec X. 

(2) There exist a finite monoid M, a homomorphism (p: X* M and a subset H F M 
such that L = 

(3) The syntactic monoid of L is finite. □ 

An A-language L is called local, if there exist sets H, K C. X and I C such that 

L-{e} = {HX* n X*K) - X*IX*. 

The membership of a nonempty word w in such an L can be tested by checking that 
the first letter of w is in H, the last letter of w is in K, and that no two consecutive 
letters of w form a pair belonging to I. Note that a local language may, according to 
our definition, contain the empty word. 

A homomorphism p: X* Y* is called length-preserving if \wp\ = |rc| for all w G X*. 
Obviously p is length-preserving iff Xp C Y. 

In terms of these concepts one more characterization of Rec can be given. 

Theorem 1.5.9 An X-language L is recognizable iff L = Up for some alphabet Y, local 
Y-language U and length-preserving morphism p: Y* ^ X*. □ 

An X-recognizer A is said to be minimal, if no X-recognizer with fewer states recog¬ 
nizes L{A). It is obvious that every regular language has a minimal recognizer. To say 
more than that, we need a few concepts. 

Let A = {A, X,5,ao, A') be an X-recognizer. It is said to be connected, if there exists 
for every a € A a word w G X* such that a = 6{ao,w). Two states a and 6 of A are said 
to be equivalent, and we write a ~ 6, if 

(Vu; G X*) (<5(a,u;) G A' ^ 5{h,w) G A'). 

The recognizer A is reduced, if a ~ 6 implies a = b. 

A relation 9 G E{A) is a congruence of A, if 
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(1) aOb implies S{a, x)95{b, x) for all a,b £ A and x £ X, and 

(2) 6 saturates A'. 

Let C'(A) be the set of all congruences of A. It is not hard to prove that ~ is a congruence 
of A. In fact, it is the greatest congruence of A. 

If 0 € C{A), then one can define a quotient recognizer 

A/e = {A/e,X,5' ,aoe,A'/O) 


by putting 

5'{aO, x) = 5{a, x)9 for all a G A and x £ X. 

The congruence property (1) guarantees that 5' is well-defined. An easy induction on 
I re I shows that 

6'{a9,w) = 6{a,w)9 for all a G A and w £ X*. 

This implies L{A/9) = L{A). In particular, L(A/~) = L{A). It is now obvious that a 
minimal recognizer should be reduced and, of course, connected. 

Let A = (A, X, 5, ao, A') and B = (S, A, ry, bo,B') be two A-recognizers. A homomor¬ 
phism ip: A —> B is a mapping p: A ^ B such that 

(1) 5{a,x)p = r]{ap,x) for all a G A and x G A, 

(2) oqp = bo, and 

(3) B'p-^ = A'. 

Epimorphisms and isomorphisms of A-recognizers are, respectively, surjective and bi- 
jective homomorphisms. 

Homomorphisms, congruences and quotients of A-recognizers are related to each other 
the same way as the corresponding concepts in algebra. Hence, for any 9 £ C'(A), the 
natural mapping 9^ is an epimorphism A —)■ A/0. If (/?; A —)■ B is an epimorphism, then 
pp~^ is a congruence of A and A/pp~^ is isomorphic to B. Moreover, 

5{a, w)p = r]{ap, w) for all a G A, re G A*. 

This implies L(A) = L(B). 

The A-recognizer B is a subrecognizer of A if B C A, bo = oq, B' = A' n B and 
7] = S\B X X. The subset B determines such a subrecognizer completely. The connected 
part 

Ac = {S(ao,w) I w £ A*} 

of an A-recognizer is the state set of a subrecognizer 

Ac = (Ac, X, 6c, ao, A^ n Ac) 


where 6c = 6\Ac x A. 

The following theorem summarizes the main facts concerning minimal and reduced 
recognizers. 
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Theorem 1.5.10 (a) The minimal recognizer of a regular language is unique up to 

isomorphism, i.e., if two recognizers are minimal and equivalent to each other, then 
they are isomorphic. 

(b) A recognizer is minimal iff it is connected and reduced. 

(c) For any recognizer A, the quotient A/~ is reduced and its connected part (A/~)c 
is minimal. The recognizer Ac/~ is isomorphic to (A/~)c. 

(d) If A is minimal, B is connected and L{A) = L(B), then there exists a unique 

epimorphism y?: B —)■ A. □ 

Theorem 11.5.101 implies that one can find a minimal recognizer for a regular language 
L by starting with any recognizer A of L; first one finds the connected part Ac and 
then one has to determine the equivalent pairs of states in Ac. For both tasks there are 
simple algorithms. The order may also be reversed; first form A/ ~ and then find the 
connected part of this reduced recognizer. 

The decidability of the emptiness, finiteness and equality questions for regular lan¬ 
guages follows from the following simple observation. 

Lemma 1.5.11 Let A be an X-recognizer with n states. 

(a) If L{A) contains a word w of length > n, then one may write w = uvz so that 
0 < |u| < n and uv^z € L{A) for all k > 0. 

(b) L{A) is nonempty iff it contains a word of length < n. 

(c) L{A) is infinite iff it contains a word w such that n < \w\ < 2n. □ 

Statement (a) is often referred to as the “pumping lemma” for finite recognizers. 

To test whether L{A) is nonempty it suffices to try all input words of length < |A|. 
Similarly, the finiteness of L{A) can be checked by applying all input words w such 
that |A| < |r(;| < 2\A\. From any two A-recognizers A and B one can construct a 
recognizer for (L(A) — L(B)) U (L(B) — L(A)). But this language is empty exactly in 
case L[A) = L(B). Hence, the equivalence of A and B can also be decided. 

1.6 GRAMMARS AND CONTEXT-FREE LANGUAGES 

We shall now consider the most important tools of formal language theory, Chomsky’s 
grammars. A grammar is a device to define a language by showing how to generate the 
strings of the language. The concept is very flexible, and by imposing various restrictions 
on grammars several interesting families of languages can be obtained. A good example 
is provided by the celebrated Chomsky hierarchy consisting of four families of languages. 
At the bottom of the hierarchy we find, once more, the recognizable languages. However, 
most of this section will be devoted to context-free languages. These form the second 
step in the hierarchy. 


35 




1 PRELIMINARIES 


Definition 1.6.1 A grammar is a 4-tuple {N, X, P,ao), where 

(1) is a finite nonempty set of nonterminal symbols, 

(2) X is the terminal alphabet, 

(3) P is the finite set of productions, and 

(4) oq G is the initial symbol. 

It is required that N r\ X = 0. Every production is of the form /3 —)• 7 , where / 3,7 € 
[N U X)* and j3 contains at least one nonterminal symbol. 

Let G = {N, X, P,ao) be a grammar. For u,v £ {N U X)* we write u =^g v (or 
just u ^ V, when G is understood) if there exist u',u" G {N U X)* and a production 
/? —)■ 7 G P so that u = u'fdu" and v = v!^v!' . If u =^g v, then u is said to generate v 
directly in G. If there exists a derivation 


Uq =^G Ul =^G U2 =^G ■ ■ ■ =^G Un (n > 0) 

such that uq = u and Un = v, then we write u v (or just u v). The language 
generated by G is the A-language 

L{G) = {re G A* I ao ^*g w]. 

Two grammars are equivalent, if they generate the same language. 

The grammars of Definition 11.6.11 are very general and every recursively enumerable 
language can be generated by such a grammar. 

Definition 1.6.2 A grammar (N,X,P,ao) is called right linear, if each production is 
of the form 

a —)■ xb, a —)■ X or a —>■ e, 

where a,b £ N and x £ X. A language is right linear, or of type 3 (in the Chomsky 
hierarchy), if it can be generated by a right linear grammar. 

A right linear grammar G = {N,X,P,aQ) can be converted into a nondeterministic 
A-recognizer 

A = (AU{c},A,5,{ao},A') (c 0 A) 
which recognizes L{G) as follows. For any a,b £ N and x £ X, put 

(i) b £ S(a, x) iff a ^ xb £ P, 

(ii) c G 6{a, x) iff a ^ x £ P, and 

(iii) d{c,x) = 0. 
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Finally, let A' = {c} U{aGA^|a^eG P}. Conversely, every X-recognizer A = 
(A, X, 5, ao. A') can be replaced by the right linear grammar G = (A, X, P, oq), where 

P = {a —)■ I 5{a, x) = 6} U {a —e | a G A’}. 

These observations lead to one more characterization of Rec; 

Theorem 1.6.3 The type 3 languages are exactly the regular languages. □ 

Now we proceed to the main topic of this section. 

Definition 1.6.4 A grammar (N,X,P,ao) is context-free (CF, for short) if each pro¬ 
duction is of the form 

a —7> 7 

where a G N and 7 G {N U A)*. A language is context-free (CF) if it is generated by 
a CF grammar. The family of all CF languages is denoted by CF and the set of CF 
A-languages by CF(A). 

The CF languages are the type 2 languages in Chomsky’s hierarchy. Every right linear 
grammar is CF. Hence Rec C CF. If |A| = 1, then Rec A = CF(A), but in all other 
cases the inclusion is proper. 

Example 1.6.5 Suppose A contains two distinct letters x and y. Every derivation in 
the CE grammar 

G = ({a}. A, {a —)■ xay, a —)■ xy}, a) 

is of the form 


a xay xxayy ^ ... ^ x^ ^ ^ {n > 1 ). 

Hence, L[G) is the nonregular language {x'^y^ | u > 1}. □ 

The main fact to connect CE languages with tree automata is that context-free deriva¬ 
tions can be represented by derivation trees. A derivation tree is a description of the 
syntax of a word of the CE language. (Here it would be more natural to speak about 
“sentences” of a language.) Derivation trees have proved very useful tools in the theory 
of CE languages. Later we shall define “trees” in a way suitable for our purposes, but 
here there is no need to define the concept too formally. 

Let G = (A, A, P, ao) be a CE grammar. The derivation tree representing a derivation 
of a word u G (A U N)* from a symbol a G (A U A) in G is defined by induction on the 
number k of steps in the derivation: 

1° If A: = 0, then u = a and the derivation tree consists of a single node labelled by a. 
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2° Consider a derivation 


a => Ml => tt2 ^ Uk — l u (*) 

where k > 1. Suppose mi = di... dm, where m > 0 and di,... ,dm G N U X. 
At this point the context-freeness of G becomes essential. Every application of a 
production in (*) rewrites exactly one di or a nonterminal derived from exactly one 
di- This means that (*) may be decomposed into a number of “subderivations” 

di ^ ^ Vi (i = 1,... ,m) 

each of which yields a segment Vi of u and u = viV2 ■ ■ ■ Vm- If the derivation trees 
of the subderivations are ti,... ,tm, respectively, then the derivation tree of (*) is 
that shown in Fig. 11.21 

The possibility m = 0 was not excluded. Then k = 1, u = e and the derivation 
tree reduces to a single node labelled by a. 

The word xxxyyy has the derivation 

a => xay xxayy xxxyyy 

in the grammar of Example [L63J The corresponding derivation tree is shown in Fig. 11.31 
Consider any derivation 

Uq ^ ^ w 

of a terminal word w € L{G) from the initial symbol. The corresponding derivation 



Figure 1.2 



Figure 1.3 


tree is also called a derivation tree of w, and w can be read from the “leaves” of the tree. 

The grammar G of Example 11.6.51 has the rather special property that every word in 
L{G) has just one derivation in G. 
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Example 1.6.6 Consider the CF grammar 

G = ({ao, a, b}, {x, y}, P, oq) 
where P consists of the productions 


oo ab, a —>■ xay, a xy, b —>■ ybx and b yx. 


Obviously, L{G) 
derivations 


{xmym+Uxn ^ iji,n > 1}. The word xyyx G L{G) has the two 
ao => a6 xyb xyyx 


and 


ao => a6 => ayx => xyyx 


both of which are represented by the derivation tree shown in Fig. 11.41 In general, the 
word has different derivations all of which are represented by the same 

derivation tree. □ 



Figure 1.4 

In Example 11.6.61 the different derivations of the same word do not represent different 
syntactic descriptions of the word. In fact, they can all be obtained from each other by 
changing the order in which the individual steps are carried out. If we agree on some 
fixed order in which the sub derivations are to be carried out, then there would be just 
one derivation for each derivation tree of a word in the language. 

Definition 1.6.7 A derivation 

Uq ^ Ui U2 ^ ^ Uk 


in a CF grammar G = {N,X,P,ao) is called a leftmost derivation, if we can write, for 
every i = Q,... ,k — 1, 

Ui = Wiau[ and Uj+i = Wi'yu[ 

so that Wi G X*, a & N and a ^ P. The grammar G is ambiguous if some word w in 
L{G) has two different leftmost derivations from ao- Otherwise G is unambiguous. A CF 
language generated by at least one unambiguous CF grammar is said to be unambiguous. 
If all CF grammars generating a given CF language are ambiguous, then the language 
is said to be inherently ambiguous. 
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A CF grammar G is unambiguous if every word w € L{G) has exactly one derivation 
tree. It is ambiguous, if at least one word w € L[G) has more than one derivation tree. 
The grammars of Examples II. 6. 51 and II. 6. 6) are unambiguous. Every regular language is 
unambiguous. Of course, a language generated by an ambiguous CE grammar may be 
unambiguous. The language 

\ i = j ox j = k (i, j, k > 1)} 

is a well-known example of an inherently ambiguous language. 

There are many simplifying additional conditions that a CE grammar may always be 
assumed to satisfy. Some of these are listed below. 

Definition 1.6.8 Let G = (iV, X, P, oq) be a CE grammar. 

(a) G is reduced if either P = 0 and N = {ao}, or then for every a € N, 

oo =>* uav =>* w 

for some u,v £ {N VJ X)* and w & X*. 

(b) G is in Chomsky normal form if each production is of the form 

(i) a —7> 6c {a & N, b,c N — oq), 

(ii) a —>■ X {a G N, X € X), or 
(hi) ao —> e. 

(c) G is in Greibach normal form if each production is of the form 

(i) a —^ xai... Om {m > 0, a G N, ai,... ,am G N — oq, x G X), ox 

(ii) ao e. 

If m < A: for all productions of type (i), then G is said to be in Greibach k-form {k > 0). 

Proofs for the following facts can be found in the references given at the end of the 

section. 

Theorem 1.6.9 (a) Every CF grammar {N,X,P,ao) can he converted into an equiv¬ 

alent reduced CF grammar {N',X,P', 00 ), where N' C N and P' C P. 

(b) Every CF grammar can he converted into an equivalent CF grammar in any one 
of the following normal forms: Chomsky normal form, Greibach normal form, and 
Greibach 2-form. In all cases the grammar can he made reduced. □ 

We recall now some of the closure properties of the family CE. 

Theorem 1.6.10 If the languages U andV are CF, then so are U \JV, UV andU*. □ 
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The languages U = {x^^y'^z^ | m, n > 1} and V = | m, n > 1} are CF, but 

UnV = {x'^y'^z^ I n > 1} is not. This observation implies also that the difference U — V 
of two CF languages U and V may be noncontext-free. However, the following theorem 
holds. 

Theorem 1.6.11 If U is a CF language and V is a regular language, then U CV and 
U — V are CF languages. □ 

The following theorem implies, as a special case, that CF is closed under morphisms. 

Lemma 1.6.12 Let ip: pX* — > pT* be a substitution mapping such that xip G CF(y) 
for all xeX. IfU e CF{X), then Uip € CF(y). □ 

The following useful lemma is obtained most naturally by considering derivation trees. 

Lemma 1.6.13 (Bar-Hillel’s pumping lemma). For each CF grammar G one can find 
two natural numbers p and q such that the following holds for every word w G L{G): if 
|r(;| > p, then we may write w = uiViw'v 2 U 2 so that 

(i) \viw'v 2 \ < q, 

(ii) viV 2 7 ^ e, and 

(hi) uiv\w'v 2 U 2 G L{G) for every i > 0. □ 

Next we recall some decidability properties of CF languages. A CF language is always 
assumed to be given by a CF grammar generating it. 

Theorem 1.6.14 There are algorithms for deciding the following questions: 

(1) Is a given word in a given CF language? 

(2) Is a given CF language empty? 

(3) Is a given CF language finite? □ 

The decidability of the finiteness problem follows from Bar-Hillel’s lemma. The other 
two statements can be justified quite directly. 

Theorem 1.6.15 The following questions are undecidable: 

(a) Are two given CF languages equal? 

(b) Is the intersection of two given CF languages empty? \ finite? \ regular? \ context- 
free? 

(c) Is the complement X* — U of a CF X-language U empty? \ finite? \ regular? \ 
context-free? 
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(d) Is a given CF grammar ambiguous? 

(e) Is a given CF language inherently ambiguous? □ 

In the previous section we noted that every regular language has a minimal recognizer. 
One might want to find a CF grammar equivalent to a given one with the smallest 
possible number of nonterminals {nonterminal minimization problem) or with a mini¬ 
mum number of productions {production minimization problem). However, the following 
theorem holds. 

Theorem 1.6.16 Both the nonterminal minimization problem and the production min¬ 
imization problem are unsolvable. □ 

Let n be a fixed natural number. The sum of two n-tuples of nonnegative integers 

a = (ai, ...,an) and b = {bi, ...,bn) 

is formed componentwise: 

a -|- b = (ai bi,..., On + bn). 

Similarly, we put 

ka. = {kai ,..., kon) 

for all k G Nq and a G Nq. 

A subset K of Nq is called linear, if there exist an m > 0 and n-tuples ai,... ,am, 
b G Ng such that 

K = \k\a\ 

km^m + b I /ci,..., 

kfn GNo}. 

A subset of Nq is semilinear if it is the union of finitely many linear sets. 

Let X be an alphabet with n letters (n > 1). It is convenient to think that the letters 
of X are listed in some fixed order, xi,..., x^. The Parikh vector of a word w G X* is 
the n-tuple 

Par(t(;) = (oi,... ,0^) 

where ai is the number of occurrences of x^ in tc {i = 1,... ,n). The resulting Parikh 
mapping 

Par: A* ^ Ng 

satisfies the conditions 

(i) Par(e) = (0,... ,0) 

and 

(ii) Par(ux) = Par(u) -f Par(x) {u,v G X*). 

The mapping Par is extended to A-languages in the natural way: 

Par(L) = {Par(u;) \ w £ L} 

for all LCX*. 

Theorem 1.6.17 For every CF language L, the Parikh set Par(L) is semilinear. □ 
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1.7 SEQUENTIAL MACHINES 

Automata that produce outputs in response to inputs are generally called sequential 
machines. The basic example of these is provided by the Mealy-machine which arose 
as an abstract model of digital circuits with memory. A Mealy-machine is a system 
A = {X, A, Y, ao, 5, A), where 

(1) A is the input alphabet, 

(2) A is a finite, nonempty set of states, 

(3) Y is the output alphabet, 

(4) ao G A is the initial state, 

(5) (5: A X A —)■ A is the next-state function, and 

(6) A: A X A —> y is the output function. 

In many applications there is no fixed initial state, and oq is then omitted from the 
definition. The operation of A can be described as follows. If A is in state a (gA) and 
receives an input x (G A), then it enters state 5(a,x) and emits the letter X{a,x). In 
order to describe the behaviour of A under an arbitrary input word w G A* we extend 
S and A to mappings 

<5;AxA*^A, A:AxA*^y* 

as follows: 

1° 6{a, e) = a and A(a, e) = e for every a G A. 

2° 6{a,wx) = 5{6{a,w),x) and X{a,wx) = X{a,w)X{S{a,w),x) for all a G A, re G A*, 
X G A. 

If A receives in state a the input word w, it emits the word X{a,w) (gA*) and ends 
up in state S{a,w). The translation induced by A is defined as the relation 

ta = {{w, A(ao, w))\w€ A*} (C A* x A*). 

Two Mealy-machines are said to be equivalent if they define the same translation. 

In the case of a Mealy-machine A every input word w has exactly one translation 
X{ao,w) and this has the same length as w. Mealy-machines enjoy a number of desirable 
properties and they have a well-developed theory. For example, the following facts are 
known: 

(a) The translations induced by Mealy-machines have a very simple characterization. 

(b) The equivalence problem of Mealy-machines is decidable. 

(c) For any Mealy-machine one can find an equivalent minimal Mealy-machine and this 
is unique up to isomorphism. 
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(d) Let A be the Mealy-machine defined above. If L G Rec X, then Lta G Rec Y. If 
L G Rec Y, then Lt^^ G Rec X. 

There are several ways to generalize Mealy-machines. First of all, both the next-state 
and the output behaviour may be nondeterministic. Another generalization allows the 
sequential machine to emit a word in response to each input letter. Moreover, one may 
add a set of final states. Then a translation of a word is accepted just in case it leaves 
the machine in a final state. We shall now define a generalized sequential machine which 
includes all these features. It is now convenient to use a set of productions which will 
account both for the next-state behaviour and for the outputs. We arrive at the following 
concept. 

Definition 1.7.1 A (nondeterministic) generalized sequential machine (gsm) is a sys¬ 
tem A = (X, A, Y, ao,P, A') where 

(1) A is the input alphabet, 

(2) A is a finite, nonempty set of states, 

(3) Y is the output alphabet, 

(4) ao (sA) is the initial state, 

(5) P is a set of productions of the form ax —>■ wb with a,b G A, x € X and w G Y*, 
and 

(6) A' C A is the set of final states. 

It is assumed that An (A U A) = 0. The gsm A is said to be deterministic if there exists 
for each pair (a, x) G A x A exactly one production of the form ax —)■ wb. 

Let A be the above gsm. A production ax wb is interpreted as follows. If A 
is in state a and receives the input x, A may enter state b and simultaneously emit 
the word w. We shall now define the translation performed by A. For any two words 
p, q G (A U A U Y)*, we write p if there exist a production ax wb m P and 

words p' and p" such that p = p'axp" and q = p'wbp". The reflexive, transitive closure 
of =i>A is denoted by =^a- Thus p q {p, q G [A U X U A)*) holds iff there exists a 
derivation of the form 

P = Po Pi • • • =^A Pk = q {k> 0). 

Now, the translation induced by A is defined as the relation 

TA = I u G X*,v G Y*,aou =^a fo^' some b G A'}. 

If (n, v) G Tx, then u is a translation of u. If A is deterministic, then each A-word w has 
at most one translation. Two gsm’s are equivalent if they induce the same translation. 

The tree transducers, which form the subject matter of Chapter [H may be viewed as 
further generalizations of gsm’s in which trees replace words as inputs and as outputs. 
The following two theorems may be compared with some of the results to be presented 
in Chapter 01 
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Theorem 1.7.2 Let A = {X, A,Y,ao, P, A') be a gsm. If L ^ Rec X, then Lta £ 
Rec Y. If L ^ Rec Y, then Lt^ G Rec X. □ 

Theorem 1.7.3 The equivalence problem of deterministic gsm’s is decidable, but the 
equivalence problem of nondeterministic gsm ’s is undecidable. □ 

The next-state behaviour of a gsm is identical to that of a nondeterministic Rabin-Scott 
recognizer. Thus the following fact, which will be needed in Chapter 01 is obvious. 

Lemma 1.7.4 Let A be a gsm as defined above. For any two states a,b G A, the 
language 

L{a, b) = {u £ X* \ au =^*a bv for some v G Y*} 
is regular. □ 
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2 TREE RECOGNIZERS AND 
RECOGNIZABLE FORESTS 


This chapter is devoted to finite-state tree recognizers and the family of forests recog¬ 
nizable by them. Here trees are defined as terms over a finite operator domain, and a 
forest (or tree language) is just a set of trees. As in the case of formal languages, there 
are two particularly natural ways to effectively define a forest; a forest can be recognized 
by an automaton, or it can be generated by a grammar. In Section 12.21 we introduce 
the tree recognizers which correspond to Rabin-Scott recognizers. It does not make any 
difference whether Rabin-Scott recognizers are defined to read words from left to right or 
from right to left, but here we should consider both recognizers that read trees from the 
leaves down towards the root (frontier-to-root recognizers) and recognizers which work 
in the opposite direction (root-to-frontier tree recognizers). In both cases the recognizer 
may be either deterministic or nondeterministic. This gives us four types of finite-state 
tree recognizers. Three of these define the same family of forests, the family Rec of rec¬ 
ognizable forests. Deterministic root-to-frontier recognizers are essentially weaker and 
they define a proper subfamily of Rec. In Section (2.31 we define regular tree grammars. 
After having shown that these can be reduced to a very simple normal form, we prove 
that regular tree grammars generate exactly the recognizable forests. Often it will be 
convenient to use regular tree grammars in the study of recognizable forests. In Section 
El several operations on forests are considered. Many of these arise as a generalization 
of some basic language operation. Usually Rec can be shown to be closed under such 
operations. However, one should note that there are often many ways to generalize from 
languages to forests, and a right choice among the alternatives is essential if one wants to 
generalize the corresponding results, too. For example, there is a natural generalization 
of the product of languages with respect to which Rec is not even closed. A related point 
is demonstrated by the case of tree homomorphisms. Here the greater generality of trees 
compared with words admits of some entirely new phenomena, such as the copying of 
subtrees. 

In Section 12.51 regular expressions to denote forests are defined, and the appropriate 
generalized Kleene theorem can then be proved. Section f2.6l contains the minimization 
theory of deterministic frontier-to-root tree recognizers. In Sections 12.71 tothe family 
Rec is characterized in some further ways. Recognizable forests are described by means 
of congruences of the term algebra, as solutions of fixed-point equations, and in terms of 
local forests. Moreover, a Medvedev-type characterization in terms of certain elementary 
forests and elementary operations is given. In Section [2. 101 we show that the emptiness, 
the finiteness, and the equivalence problems of recognizable forests are decidable. Section 
I2.11l is devoted to deterministic root-to-frontier recognizers. The forests recognizable by 
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them are characterized by means of a certain closure property. Furthermore, we show 
that these recognizers have canonical minimal forms. 

In this chapter we try to cover the central parts of what could be called “the gener¬ 
alized theory of finite automata”, but many topics had to be excluded. Some of these 
are mentioned in the Notes and references. There we shall also indicate a few other 
developments not directly related to this chapter as well as some applications of the 
theory of tree automata. 

2.1 TREES AND FORESTS 

The “trees” which appear in tree automata theory may be visualized as tree-like directed 
labelled graphs. Such a tree has exactly one node, the root, to which no edge enters. 
From the root there is exactly one path to every node. Moreover, it is essential that 
the edges leaving a given node have a specified left-to-right order. This concept has 
been formalized in several ways, but the variations in the definition are of little or no 
consequence. We shall choose a definition that suits well an algebraic treatment of the 
theory. 

For the labelling of the nodes of a tree we need two alphabets of different kind, a ranked 
alphabet and a frontier alphabet. As a rule, these two are assumed to be disjoint. A 
ranked alphabet is a finite nonempty operator domain (cf. Sect. II.2p . From now on S 
always represents a ranked alphabet. Other symbols to be used for ranked alphabets 
include D and F. The inclusion SCO means that S^ C 0^ for all m > 0. If S^nO^ = 0 
whenever m ^ n, then S U O may be defined: 

(S U = Ejn U Dm for all m > 0. 

A frontier alphabet is simply an alphabet in the usual sense, but sometimes we should 
let it be empty. In fact, in most cases there is no need to exclude this possibility. Our 
usual symbols for frontier alphabets are A, Y and Z. 

For any S and A, a TX-tree is simply a SA-term. Thus the set of SA-trees is Fs(A). 
In many cases S or A, or both, are either understood or unspecified. In such cases we 
often speak about Y-trees, X-trees or just trees. A similar situation will arise whenever a 
concept involves a ranked alphabet and a frontier alphabet. We shall not lengthen such 
definitions by listing the modified names, but they will be used without explanation 
whenever convenient. 

The letters p, q, r, s and t are reserved for trees. 

Although trees are defined as strings, they can be visualized as, and are in fact intended 
as representations of, such tree structures as described above. 

Example 2.1.1 Let S = S 0 US 1 US 2 be a ranked alphabet, where Sq = { 7 }, Si = {w} 
and S 2 = {cr}. As the frontier alphabet we take A = {x,y}. Then t = u {a{y,a{'y,x))) 
is the SA-tree shown in Fig. 12.11 □ 

Any other way of writing SA-terms would suit our purpose equally well. For example, 
in Polish notation the tree t of Example 12 . 1.11 would be written as ujaya’jx, but it would 
still be treated in tree automaton theory as the “tree” shown in Fig. 12.11 
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2.1 Trees and forests 



Figure 2.1. 


Term induction will now be called tree induction. Below some important concepts are 
defined by tree induction. 

Definition 2.1.2 The height hg(t), the root root(t) and the set of subtrees sub(t) of a 
SX-tree t are defined as follows: 

1° If t E X U So, then hg(t) = 0, root(t) = t and sub(t) = {t}. 

2° If t = a{ti, ..., tm) (m > 0), then 

hg(t) = max(hg(tj) \ i = 1 ,... ,m) + 1 , 
root(t) = a, and 

sub(t) = U (sub(ti) 1 1 < i < m) U t. 

For the tree of ExamDle l2.1.1l we get hg(t) = 3, root(t) = co and sub(t) = {t, a {y, a(j, x}) , 
y,o-( 7 ,x),j,x}. 

Subtrees of height 0 are referred to as the leaves of the tree. A leaf is labelled by a 
letter from the frontier alphabet or by a nullary operator. The length |t| of a tree t is 
simply its length as a word. The leaves of tree t of our example are y, 7 and x. Its length 
is 15 (when parentheses and commas are counted, too). Of course, one can define and 
prove things about trees by induction on the length; but in practice this mostly reduces 
to tree induction. Induction on the height hg(t) is equivalent to tree induction. 

We shall use the term frontier in a rather informal way to designate the part of a tree 
consisting of the leaves. The frontier of the tree of Example 12 .1.11 consists of the nodes 
labelled hy y, 7 and x. The same letter or nullary operator could appear several times as 
a leaf in the frontier. The visual picture of a tree also suggests the notions of a branch 
and that of a path. In our t there are two main branches leaving the lower a. They 
correspond to the subtrees y and £ 7 ( 7 , x). There are three paths from the root to the 
frontier. They spell out the words ovay, ujaa 7 and uaax, respectively. These terms are 
used in a descriptive manner to aid the intuition and no precise definitions are needed. 

Note. In the literature the root is often called the “top” of the tree, while its frontier 
is referred to as the “bottom”. Then “top-down” indicates the direction from the root 
towards the frontier, and “bottom-up” means the opposite direction. This terminology 
is connected with the common practice of drawing trees upside-down. 
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2 TREE RECOGNIZERS AND RECOGNIZABLE FORESTS 


The same tree may occur several times as a subtree of a given tree and one should 
distinguish between a subtree and an occurrence of a subtree. It is possible to assign 
coordinates to the nodes of a tree and then indicate a certain occurrence of a subtree by 
the coordinates of its root. However, the following simple device to specify an occurrence 
of a subtree will suffice. For any occurrence of a subtree s of a tree t, there is a unique 
way to write t = usv. Here u and v are just words and the occurrence of s is uniquely 
determined by u. 

We shall now consider some ways to construct new trees from given ones. The very 
dehnition of Fs(X) suggests such a construction. If m > 0, u G and ti,... ,trn € 
then a{ti,... ,tm) is a new SX-tree which could be called the a-catenation of 
ti, ... ,trn- It is obtained by connecting the roots of the trees ti,. .. Rm to a new root 
labelled by a. The construction is illustrated by Fig. 12.21 



Figure 2.2. 


Figure 2.3. 


Note that the cr-catenation is the u-operation of the SX-term algebra 




Let t be a SX-tree and suppose we are given a tree Sx for every x £ X. The tree 
denoted by 

t{x ^ Sx \ X £ X), or just t{x £- Sx), 

is obtained by substituting in t, simultaneously for every x € X, for each occurrence 
of X. The formal definition by tree induction reads as follows: 

1° If t = z £ X, then t(x <— Sx) = Sz. 

2° If t = a £ So, then t{x £- Sx) = cr. 

3° If t = a{ti, ... ,trn), then 

t{x ^ Sx) = a (ti{x ■(- Sa;), . . .,tm{x ^ Sx)). 


If the trees Sx are SX-trees, then t{x £- Sx) is also a SX-tree. However, the construc¬ 
tion works also in the more general case where the trees Sx are HT-trees for some H and 
Y such that S^ n = 0 whenever m ^ n. Then t{x £- Sx) £ F-£\jn{Y)- 
Suppose X = {xi,..., Xn}. One may then write t{x ■£- Sx) in the more explicit form 


t{xi 


^Xl ? • 
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2.1 Trees and forests 


If the order xi,... ,Xn is understood, we may write simply t , Sx^). 

A letter x may be left unrewritten by choosing Sx = x. The notation t(xi ■<— 
si,...,Xn ■<— Sn) is used more generally to indicate a substitution where the letters 
Xi are rewritten as the corresponding Si {i = 1, ... ,n), but all the other letters of X are 
left unchanged in the tree t. 

Example 2.1.3 Suppose 7 G Sq, cj G S 3 and x,y, z ^ X. If t = cr (y, £ 1 ( 7 , x, y), z), then 

t {y X, z -fr- a(x, X, z)) = a (x, a('y, x, x),a{x, x, z)) . 

The tree is shown in Fig. 12.31 □ 

Often a certain occurrence of a subtree s of a tree t should be replaced by a tree r. If 
the presentation t = usv indicates the particular occurrence of s, then the result is urv. 
It is easy to show that urv is also a SX-tree whenever t, r G Fj](X). The operation may 
also be described as follows. Let ^ be a new letter. There is a unique tree t' G Fs(XU^) 
with exactly one occurrence of ^ such that t = •£— s). Then urv = ■£— r). Other 

ways to operate on trees will be encountered later on. 

Trees define polynomial functions in algebras. These will be very important, and we 
shall now see how the basic tree operations are reflected in them. Let A = (A, S) 
be a S-algebra. If t G Fs(X) is obtained by cj-catenation from the trees ti,...,tm 
(m > 0, cr G S^), then 

is simply the composition of t'f,... ,tm with Now we consider the substitution 

operation. Let X = {xi,..., and t, si,..., Sn G Fs(X). The polynomial function 

t{si ,..., Sn)"^ ■ —>■ A 

is computed as follows. For any a : X ^ A, 

t{si,...,Sn)-^{a) 

where jd'.X^Ais defined so that Xijd = sA{a) for all z = 1 ,..., n. 

Finally, consider the replacing of an occurrence of a subtree s of a SX-tree t by a 
SX-tree r. Write t s) as explained above. For any a : X —A, we get then 

t'{^ t— r)'^(a) = 

where a' : X U ^ A is defined so that a'\X = a and = r"^(a). 

A SX-forest is simply a subset of Fs(X). Many authors call forests tree languages. In 
general, we use the letters R, S and T for forests. 

If S C n and X C Y, then all SX-trees are flT-trees, too. Thus every SX-forest 
may be viewed as an nX-forest. In most cases this can safely be done. For example, a 
SX-forest is recognizable (in the sense defined in the next section) as a SX-forest iff it 
is recognizable as an flX-forest. 
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2 TREE RECOGNIZERS AND RECOGNIZABLE FORESTS 


Of course, those forests only are of interest that can be defined in some natural way. 
This chapter is devoted to a family of such forests, the forests recognizable by finite tree 
automata. In the theory of these forests many concepts and results familiar from the 
theory of recognizable languages can be perceived. The generalization from words and 
languages to trees and forests will be considered in the next section. 

2.2 TREE RECOGNIZERS 

In this section we introduce tree recognizers, that is, tree automata which define forests. 
There are four basic types of these recognizers. A tree recognizer may be defined in 
such a way that it reads its input trees from the frontier towards the root. Then it is 
called a frontier-to-root recognizer, or an F-recognizer for short. A tree recognizer which 
reads the trees starting at the root proceeding then towards the frontier is called a root- 
to-frontier recognizer, or simply an R-recognizer. In both cases the recognizer may be 
either deterministic or nondeterministic. As a rule, all tree recognizers considered here 
are finite, i.e., they have a finite number of states. 

Our first task will be to compare the families of forests recognizable by these four 
types of tree recognizers. It turns out that we get just two families. Deterministic F- 
recognizers, nondeterministic T-recognizers and nondeterministic i?-recognizers all have 
the same recognition power. The forests recognized by them are termed recognizable. 
Deterministic /^-recognizers are considerably weaker and they yield a rather special 
subfamily of the recognizable forests. 

As stated in the previous section, S is always a ranked alphabet and X is a frontier 
alphabet. 

Definition 2.2.1 A frontier-to-root EX-recognizer or an {F)EX-recognizer, for short, 
A consists of 

(1) a finite S-algebra A = {A, S), 

(2) an initial assignment a : X ^ A and 

(3) a set A' C A of final states. 

We write A = (A, a. A') or A = (A,E, X, a, A'). The forest recognized by A is the 
SX-forest 

r(A) = {te Fs(X) I f^{a) G A'}. 

A SX-forest T is said to be recognizable, if there exists a SX-recognizer A such that 
T = r(A). The family of recognizable forests is denoted by Rec, and Rec(5], X) denotes 
the set of all recognizable SX-forests. 

The recognizers defined above are finite and deterministic although this has not been 
emphasized in the name. They are our “basic” type of tree recognizer and we shall usually 
omit the label “T” which distinguishes them from root-to-frontier tree recognizers. The 
elements of the underlying algebra A are called the states of A and A is its state set. 
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2.2 Tree recognizers 


If not otherwise specified, A will be the SA-recognizer (A, a, A'). Also B and C will 
usually be the SA-recognizers {B,j3,B') and (C, 7 ,C"), respectively. Here B = 
and C = {C, S) are S-algebras, (3 : X ^ B and 7 : A —)• C are the initial assignments, 
and B' Q B and C C C . 

In algebraic terms the operation of the SA-recognizer A can be explained as follows. 
Given an input tree t G Ts(A) the polynomial function is evaluated on the initial 
assignment a. The tree is accepted exactly in case the result is a final state. If 

a : As (A) A 

is the extension of a to a homomorphism, then 

= to. for every t G Ts(A), 

and we may write 

r(A) = {t G Ts(A) I ta G A'} = A'a-^. 

A more pictorial description of the operation of A in automata theoretic terms is also 
possible. Given an input tree t, A starts reading it from the leaves in states that depend 
on the labels of the leaves. If a certain leaf is labelled by a frontier letter x, then A is 
in state xa at that leaf. If the label is a nullary operator cr, then A starts from that 
leaf in state Now A moves down all the branches towards the root step by step as 
follows. If a given node v is labelled by the m-ary operator a {m > 0), then A enters v 
in state ..., am), where oi,..., am are the states of A at the nodes immediately 

above v, listed in order from left to right. The tree is accepted if A enters the root in a 
final state. 

Example 2.2.2 Let S = Si U S 2 , Si = {~}, S 2 = {A,V} and A = {x,y}. Define the 
operations of the S-algebra A = ({0,1}, S) by the tables below: 


a 

( 0 ) 

a 

b 

A^(a,6) 

V“^(o,6) 

0 

1 

0 

0 

0 

0 

1 

0 

0 

1 

0 

1 



1 

0 

0 

1 



1 

1 

1 

1 


Define an initial assignment so that xa = 1 and ya = 0. To complete the definition of 
our SA-recognizer A we choose {1} as the set of final states. The computation of A on 
the tree 

i = A (~ (A(y, x)), V(~ (y), x)) 

is shown in Fig. 12.41 The states of A at the nodes are shown in parentheses. The tree 
is accepted since the state at the root is 1. Let ~, A and V have their usual meanings as 
symbols for the logical connectives “not”, “and” and “or”. Then SA-trees are expres¬ 
sions of propositional logic in the two propositional variables x and y. If 0 and 1 are 
interpreted as the truth values “false” and “true”, respectively, then A computes the 
truth values of propositions, when the truth values of the variables are given. The forest 
recognized by A consists of the propositions (in variables x and y) that are true when x 
is true and y is false. □ 
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2 TREE RECOGNIZERS AND RECOGNIZABLE FORESTS 



Example 2.2.3 Let S = S 2 = {+, •} and X = {rci,... ,Xn} for some n > 1. The EX- 
trees may now be interpreted as arithmetic expressions in variables xi,... ,x„. Using 
the customary infix notation one could write, for example xi + xi ■ X 2 rather than 
+(xi, -(xi, X 2 )). Let m > 0 and define the S-algebra A = ({ 0 ,1 ,..., m — 1}, S) so that 

a b = a + b (mod m) 


and 


a-'^b = a-b 


(mod m) 


for all a, 6 = 0,1,..., m — 1. If t is a SX-tree and a ; X —)■ A is any mapping, then 
is the value of the expression t (mod m) when the variables are assigned values according 
to a. Thus any SX-recognizer A = (A, a, A') based on the algebra A recognizes a set of 
arithmetic expressions which get a value (mod m) in A' when each variable Xj is given 
a certain value XiU {i = 1,... ,n). □ 


The examples suggest some useful general observations on tree recognizers. A tree 
recognizer is a device that evaluates an expression (a tree) for given values of the variables 
(given by the initial assignment) and decides then on the basis of this value whether the 
expression belongs to given set or not. Since the state set is finite such an evaluation 
is always “modulo something”. For example, we could not construct a tree recognizer 
which would find out whether the value of an arithmetic expression is a prime or not. 
Similarly, there is no tree recognizer that recognizes the set of all trees in which two given 
operators appear the same number of times. The following example discusses another 
manifestation of the same phenomenon. 


Example 2.2.4 Let S = S 2 = {u} and let X be an arbitrary nonempty frontier alpha¬ 
bet. Then the forest 

T = {ait,t) \ t€F^{X)} 

is not recognizable. For suppose T = T(A) for some SX-recognizer A. Since A is finite, 
there must exist two different SX-trees s and t such that sa = ta. But then we would 
have that 

a{s,t)a = (j'^{sa,ta) = cj'^(sd, sd) = cj(s,s)q; € A', 
which implies the contradiction a{s, t) (z T. □ 
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2.2 Tree recognizers 


Let us now look how tree recognizers arise as generalizations of the Rabin-Scott recog¬ 
nizers through a universal algebraic interpretation. First, let A = (A,I,6,ao,A') be an 
/-recognizer as defined in Sect. 11.51 (to avoid confusion we use I as the input alphabet). 
Define a ranked alphabet S such that Si = / and S^ = 0 for all m 7 ^ 1. The next-state 
mapping of A is completely determined by the S-algebra A = (A, S) which is defined 
so that 

cr'^(a) = 6{a,a) for all a G A and a G I. 

If we put X = {x}, then /-words and SA-trees can be identified as follows. The empty 
word e corresponds to the tree x, and a nonempty word ai... {k > l,ai G I) may 
be interpreted as the tree (Tfc(... iTi(x) ...) (the reverse Polish notation for trees would 
make the identification even more natural). Dehne a : A —> A so that xa = oq. Then 

5(ao, t) = t'^(a) for all t G I*{= Fy,{X)\). 

This implies that the forest recognized by the SA-recognizer (A, a, A') is, interpreted 
as an /-language, the language recognized by A. Hence a Rabin-Scott recognizer may 
be viewed as a tree recognizer over a unary ranked alphabet and a one-element frontier 
alphabet. The general SA-recognizers result when one does not require S to be unary 
and allows also an arbitrary frontier alphabet A. 

The nondeterministic frontier-to-root tree recognizers that we soon shall dehne may 
be viewed as generalized T’-tree recognizers in which nondeterminism is allowed both in 
the assignment of states to the leaves and in the next-state behaviour. First we have to 
introduce nondeterministic operations and nondeterministic algebras. 

An m-ary nondeterministic (ND) operation on a set A is a mapping from A™ to pA 
(m > 0). Thus an m-ary ND operation 

/:A™ ^ pA 

assigns to every m-tuple of elements from A a subset of A. A nullary ND operation 

/ : {0} ^ pA 

hxes a subset of A, and / may be identihed with this subset /(0). A nondeterministic 
(ND) Tj-algebra A = (A, S) consists of a nonempty set A and a family {o"^ | u G S} of 
ND operations on A such that for each <7 G S, is m-ary if a G Sm- The ND S-algebra 
is finite if A is hnite. A S-algebra may be viewed as an ND S-algebra when elements 
a G A are identihed with the corresponding singletons {a}. 

On the other hand, we associate with every ND S-algebra A = (A, S) an ordinary 
S-algebra, namely the subset algebra 

pA = (pA,S) 


where 

(Ai, . . . , Ayyj) — (cT (cii, • • • , Um) I € Ai , . . . , Qm € A^'j 
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2 TREE RECOGNIZERS AND RECOGNIZABLE FORESTS 


for all m > 0, fj G and Ai,..., Am C A. Now any mapping 

a : X ^ pA 

may be extended to a homomorphism 

a : —>■ p^. 

Consider a SX-tree t. The computation of the set ta may be described in automata 
theoretic terms as follows. If a leaf is labelled by a letter x, then the “automaton” A 
may start at that leaf in any one of the states in xa. If a leaf is labelled by a nullary 
operator, then is the set of the possible starting states. Let v be any node in the 
tree labelled by an m-ary symbol a (m > 0). Let a(ti,. ■ ■ Am) be the subtree of t 
which has v as its root. Then tia,... are the respective sets of possible states of 
A at the nodes immediately above v. Now A may enter v in any one of the states from 
crP-^(tiQ;,..., tmA). Clearly, ta is the set of all states in which A may be at the root of t. 


Definition 2.2.5 A nondeterministic frontier-to-root EX-recognizer, or an NDF EX- 
recognizer for short, A consists of 

(1) a finite ND S-algebra A = (A, S), 

(2) an initial assignment a : X —> pA and 

(3) a set A' C A of final states. 

We write A = (A, a. A') or A = (A, S, X, a, A'). The forest reeognized by A is the 
SX-forest 

r(A) = {te Fs(x) \ tan A' A 0}- 

The definition of T{A) means that a tree t is accepted by A iff there is a set of choices 
of initial states for the leaves and next-states for the other nodes such that A enters the 
root of t in a final state. It is rather obvious that the SX-recognizer 

pA = (pA,a,A"), 

where 

A" = {Ai G pA I Ai n AV 0}, 

recognizes the same forest as A. Indeed, for any t G Fs(X), 

t G r(pA) iff E-^{a) G A" iff ta G A" 
iff tdnAV0 iff tGT(A). 

This is the natural generalization of the usual subset construction as applied to ND 
Rabin-Scott recognizers, and pA is the “subset recognizer” corresponding to A. Since 
every SX-recognizer may be viewed as an equivalent NDF SX-recognizer we have veri¬ 
fied the following theorem. 
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2.2 Tree recognizers 


Theorem 2.2.6 The forests recognized by nondeterministic frontier-to-root recognizers 
are exactly the recognizable forests. □ 

We begin the discussion of root-to-frontier tree recognizers with the nondeterministic 
version. In a nondeterministic root-to-frontier T,-algebra (NDR T-algebra, for short) 
A = {A, S), A is a nonempty set and every a € with m > 1 is realized as a mapping 

: A ^ p{A^). 

For a G Sq, is a subset of A. We call A finite, if A is finite. 

Definition 2.2.7 A nondeterministic root-to-frontier T,X-recognizer A, or an NDR 
YjX-recognizer, consists of 

(1) a finite NDR S-algebra A = {A, S), 

(2) a set A' C A of initial states, and 

(3) a final assignment a : A —> pA. 

We write A = {A, A', a) or A = {A,T,, X, A',a). The elements of A are called states. 

In order to make the formal definition of the forest recognized by such an A easier to 
understand, we shall first describe its intended operation. At the root of a given SA-tree 
t, A may be in any initial state a G A'. Consider now any node u of t labelled by some 
a G Sm with m > 1. If a is a possible state of A at u and (oi,... ,am) G a'^{a), then 
A may assume state oi at the leftmost node immediately above v, state 02 at the node 
immediately to the right of this node etc. For every m-tuple in A has such a 

sequence of possible next-states for the nodes directly above v. Note that the possible 
states at these nodes are connected with each other: (oi,..., am), {o.[, ■ ■ ■, a'm) G o"^(a) 
does not imply, for example, (a'^, 02 ,..., Om) G a-^(a). The tree t is accepted by A if it 
is possible to choose the initial state for the root and then make the consecutive choices 
of next-state vectors in such a way that A arrives at each leaf labelled by a frontier 
letter x in a state belonging to xa, and at each leaf labelled by a 0 -ary symbol cr in a 
state belonging to o"^. It is easier to formalize this recognition process by tracing it from 
the leaves back to the root. The idea is to see which states at each node can lead to 
acceptance. For the leaves this is clear. If a leaf is labelled by x G A, then the accepting 
states for that leaf form the set xa. If a leaf is labelled by ct G Sq, then the accepting 
states are those belonging to a-^. Now one can infer the states that are accepting at the 
nodes immediately below the leaves. When these have been found, we may determine 
the states in which A should be at nodes one level deeper in a tree. Finally one finds 
out the accepting states for the root. The tree is accepted iff at least one of these is an 
initial state. 

Definition 2.2.8 Let A = (A, A', a) be an NDR SA-recognizer. A mapping 

a : Ts(A) pA 


is defined as follows: 
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2 TREE RECOGNIZERS AND RECOGNIZABLE FORESTS 


1° If X G X, then xa = xa. 

2° If cr € So, then aa = 

3° If t = cr{ti, ..., tm) {m > 1), then 

ta = {a & A \ a-^{a) fl {tia x ... x tmo) ^ 0}. 

The forest recognized by A is the SX-forest 

r(A) = {t G Fs(X) I td n A' ^ 01- 

Example 2.2.9 Let us consider again the arithmetic expressions, defined in Example 
12.2.31 We shall construct an NDR S{xi,X 2 }-recognizer which accepts an expression in 
variables xi and X 2 iff the value of the expression is divisible by 4 when xi = 0 or 2 (mod 
4) and X 2 = 3 (mod 4). An obvious choice for a state set is A = {0,1, 2,3}. The set of 
initial states is {0}, and the final assignment is defined by xia = {0, 2} and X 20 t = {3}. 
The next-state behaviour is determined by inferring the possible summands or factors 
from the sum or product, respectively. We get 

+^(0) = {(0,0), (1,3), (2,2), (3,1)} 

+"^(1) = {(0,1), (1,0), (2, 3), (3, 2)} 
etc., and 

(0) = {0} X A U A X {0} U {(2, 2)} 

■^(1) = {(1,1),(3,3)} 

etc.. 

Note that we would get an equivalent NDF-recognizer by “inverting” these operations 
(0 +•^0 = 0 etc.), and making {0} the set of final states and a the initial assignment. □ 

The concluding observation of Example 12.2.91 can be generalized as follows. We say 
that the NDF SX-recognizer A = (A, S,X, a,A') and the NDR SX-recognizer B = 
(R, S, X, R',/3) are associated if 

(1) A = B, A' = B' and a = (3, 

(2) (ai,...,am) G a^{a) iff a G (T'^(ai,..., Om), for all m > 1, u G and 
a \,..., a^a , tt G A, and 

(3) for every ct G Sq. 

It easy to see that d = /3 if A and B are associated. Since every NDF tree recognizer 
has an associated NDR tree recognizer, and conversely, we get 

Theorem 2.2.10 The forests recognizable by NDR tree recognizers are exactly the rec¬ 
ognizable forests. □ 
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2.3 Regular tree grammars 


A deterministic root-to-frontier TiX-recognizer, or a DR TiX-recognizer, is a NDR T,X- 
recognizer A = (A, A', a) such that A' and all of the sets o"^{a) {a E Sm, m > 1, a E A) 
and with cr E Sq contain exactly one element. Thus a DR SA-recognizer A has 
exactly one initial state and in every situation there is exactly one choice of next-state 
vector. Moreover, there is exactly one final state for each leaf labelled by a nullary 
symbol. The forest recognized by A is defined the same way as in the general case. 

That determinism is a real limitation in the case of root-to-frontier recognizers is shown 
by the following example. 

Example 2.2.11 Suppose u E S 2 and E A. If a DR SA-recognizer accepts the 
trees a{x,y) and a{y,x), then it must accept a{x,x), too. Hence, the forest T = 
{a{x,y),a{y,x)} cannot be recognized by any DR SA-recognizer. On the other hand, 
it is obvious that T E Rec(S, A). □ 

The inability of these recognizers to cope with situations such as that in Example 
12. 2. ni ls due to the fact that they have to read disjoint subtrees separately without any 
possibility to combine the information gathered from the individual subtrees. In an NDR 
tree recognizer this handicap is compensated for by their ability to make several guesses 
about the subtrees jointly before reading them separately. 

2.3 REGULAR TREE GRAMMARS 

So far, the recognizable forests have been characterized by means of three types of tree 
recognizers. Now we shall introduce a class of tree grammars that also defines the 
family of recognizable forests. These grammars are the natural counterparts to type 3 
grammars. 

Definition 2.3.1 A regular TiX-grammar G consists of 

(1) a finite nonempty set N of nonterminal symbols, 

(2) a finite set P of productions of the form a ^ r, where a & N and r E Fj:{N U A), 
and 

(3) an initial symbol oq E N. 

It is assumed that A n (S U A) = 0. We write G = (A, S, A, P, oq). 

When S and A are not specified, we speak about regular tree grammars or just gram¬ 
mars, if there is no danger of confusion. 

Let G be a regular tree grammar as in the definition above. The right-hand side of a 
production is a tree in which nonterminal symbols may appear at the leaves only. For 
p,q & Fj]{X U N), we write 


P Q (or just p ^ q) 
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if there exist a G N,r £ Fy:{X U N) and words u,v such that p = uav,q = urv and 
a ^ r G P, i.e., p =^g Q rneans that q is obtained by replacing an occurrence of a 
nonterminal symbol a by a tree r, where o —>■ r is a production of the grammar. More 
generally, we write 

P Q P Q) 

if p = q or there exists a (nontrivial) derivation 


P =^G Pi =^G ■ ■ ■ =^G Pn-I =^G Q {n>l) 


of q from p. Hence, =^* is the reflexive, transitive closure of when we view it as a 
relation in Fs {X U N). 

Definition 2.3.2 The forest generated by a regular SX-grammar G = {N, S, X, P, oq) 
is the SX-forest 

T{G) = {tG Fs(X) I ao t}. 

Two regular SX-grammars Gi and G 2 are said to be equivalent if T{Gi) = T{G 2 )- 


Example 2.3.3 Let S = Sq U S 2 , Sq = {u;}, S 2 = {cr} and X = {x}. Define the 
regular SX-grammar 

G = ({a,6},S,X,P,a), 


where 


P = {a —)■ cj(x, cr(x, 6)), a -G cr(a;, a), b -G a{x, x)}. 


The tree 


t = a (w, cr(x, cj(x, cr(x, x)))) 


is in T{G) and it has the derivation 


a cr(a;, a) cr(a;, cr(x, cr(x, b))) => t. 


If the graphical representation of trees is used, this derivation can be written as in 

Fig. ESI □ 



Figure 2.5. 
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A regular SA-grammar may be viewed as a context-free grammar with a terminal 
alphabet consisting of S, A, the parentheses and the comma. Thus, if we treat trees as 
words, then the forests generated by regular tree grammars are special CF languages. 
However, we are mainly interested in them as forests, and we shall prove that exactly 
the recognizable forests can be generated by these grammars. To facilitate the proof 
first we show that the form of the productions may be restricted considerably without 
limiting the generative power of regular tree grammars. 

To begin with, we note that productions of the form 

a —)■ 6 (a, 6 € A) 

are not needed. All such productions can be deleted if we add to P all productions a r 
(a G N,r G Fj](X U N) — N) such that a b and 6 —^ r € P for some b G N. (It is 
easy to see that a b is decidable for a,b G N.) 

Call hg(r) the height of the production a r. If the height of a production a ^ r 
is > 1, then r is of the form cj(ri,... ,rm), where m > l,a G T,m and hg(rj) < hg(r) 
for each i = 1, ... ,m. If we introduce new nonterminal symbols ai,...,am and the 
productions 

a ^ a{ai,... ,am) (*) 

and 

ai^Vi (i = I,...,m), (**) 

then the production a ^ r may be deleted without changing the forest generated. 
Indeed, any application of a —)■ r can be replaced by an application of (*) followed by 
applications of the productions (**). On the other hand, none of the productions (**) 
can be used unless (*) has first been used, and when (*) has been applied it must be 
followed by applications of all productions (**) as there is no other way to rewrite the new 
nonterminals a*. The total effect of these steps is the same as that of a single application 
of a —)• r. Thus every production of height > 1 can be replaced by productions of lesser 
height. The process can be repeated until there are no productions of height > 1. In 
(**) there may be productions of the type a —)■ 6, but they can be eliminated. Hence 
each production of height 0 may be assumed to be of the type 

a^x {aGN,xGX) (i) 

or of the form 

a ^ a {a G N,a G Sq). (ii) 

A production of height 1 is of the form 

a ^ cr(ri,... ,rm) {m > l,a G T.m,a G N), 

where each r* is a frontier letter, a 0-ary operator or a nonterminal symbol. If is a 
letter from A or a 0-ary operator, then we may substitute a new nonterminal symbol 
d for it and introduce the production d —)■ r* of height 0 without changing the forest 
generated. Thus we may assume that all productions of height 1 are of the form 

a ^ a{ai,... ,am) {m > l,a G T.m,a,ai,... ,am G N). (hi) 
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We say that a regular tree grammar is in normal form if each of its productions is of 
type (i), (ii) or (iii). The previous discussion amounts to the following lemma. 

Lemma 2.3.4 Every regular tree grammar can he transformed into an equivalent regular 
tree grammar in normal form. □ 

Example 2.3.5 None of the productions of the grammar considered in Example 12.3.31 
is in normal form. The production a —> a{x,a{x,b)) can be replaced by the following 
set: 

o —)■ cj(ai, 02), ai^x, 02 —>■ cr(ai, 6 ). 

Notice that we could use the new nonterminal symbol ai twice since in both functions 
it should be rewritten as x. Similarly, the production a —)■ cj(a;, a) is replaced by the two 
productions 

a —)■ ( 7 ( 03 , a) and 03 —>■ tv, 

and the production b —>• a{x,x) is replaced by 6 ^ cr(ai,ai) (we already have ai —)• x). 
We have got a grammar in normal form with five nonterminal symbols a,b,ai,a 2 and 
03 , and the productions 

a —>■ cj(ai, 02 ), a —>■ £ 7 ( 03 , a), b^a{ai,ai), 

ai ^ X, 02 — )• cr(ai, 6 ) and 


□ 

The following minor generalization of regular tree grammars is introduced as a technical 
aid. An extended regular EX-grammar 

G = {N,E,X,P,A') 

is defined otherwise exactly as a regular SX-grammar, but it has a set A' C X of initial 
symbols. Also is defined the same way as for regular tree grammars. The forest 
generated by such a G is 

T{G) = {t € Fs(X) I oo ^ some oq E A'}. 

It is immediately clear that every language generated by an extended regular tree gram¬ 
mar can be generated by an ordinary regular tree grammar, too. 

Theorem 2.3.6 The forests generated by regular tree grammars are exactly the recog¬ 
nizable forests. 

Proof. We associate with every NDF SX-recognizer A = (A, S, X, a. A') an extended 
regular SX-grammar 

G = (A,E,X,P,A'), 
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where 


P = {a—^x\x&X,aG xa} U {a —> u | ct G Sq, a G cr'^jU 

{fl ^ <^(^1 ? • • • ? ttm) I ^ ^ ^ G^,ftG(7' (fti,..., ftm)} • 

The grammar G is in normal form (i.e., the productions are of type (i)-(iii)). It is clear 
that every extended regular SX-grammar in normal form arises this way from a NDF 
SX-recognizer. To prove the theorem it suffices now to show that T(A) = T{G) for 
such an associated pair A and G. To do this we show by tree induction that 

a ^ ta iff a =^q t (*) 

holds for all a G A and t G Fs(A). 

1° For t = xGA, aGxaiffa— 7 >xGPiffa x (here we needed the fact that G 

has no productions of the form a ^ b). 

2° The case t = a G Sq is similar: a G aa iff a G iff a —?> ct G P iff a =^* a. 

3° Let t = (T(ti,... Rrn) (w. > 1) and suppose that (*) holds for ti,... ,tm and all 
states. If a =^* t, then there is a derivation of the form 

a cr(fli, . . . , Om) ^ *^(^1 ) • • • ) lm)i 


where oi,..., G A" and 


Oi =^* U for i = 1,... , m. 

Then a G (T'^(ai,..., Um) by the definition of P, and (*) implies that ai G tia,. .. ,am G 
tm^- Hence, 

a G (tid, • • •, tmd) = ta. 

Conversely, a G ta means that 

a G .. .,am) 


for some oi G fid ,...,Um G tm^- But then (*) implies oi =^* ti,... ,am =^* tm- Also, 
P contains the production a cr{ai ,..., am) and we get the required derivation 

a (t((Zi , . . . , Qm) ^ cr(ti, . . . , tm) — t- 


This completes the proof of (*), and we have for every SA-tree t. 


t G T(A) 


iff 

td n 0 


iff 

a G ta for some 

a G A' 

iff 

a =^Q t for some 

aG A' 

iff 

t G T{G). 



Hence T{A) = T(G) as required. 


□ 
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2.4 OPERATIONS ON FORESTS 

In this section some more insight into the family of recognizable forests is gained by 
studying its closure properties with respect to various forest operations. In the following 
definitions and theorems all forests usually have the same ranked alphabet and the same 
frontier alphabet. To show that this is no serious limitation, we note the following simple 
fact. 

Lemma 2.4.1 Let S and Ll be ranked alphabets such that ECU, and let X and Y be 
frontier alphabets such that X <TY. Then 

Rec{Y,X) =Rec{Q,Y)npFj:{X). □ 

Of course, the lemma presupposes the point of view that every SX-forest is also an 
OT-forest. Now let S and Ll be any ranked alphabets such that H Lin = 0 whenever 
m ^ n. Also, let X and Y be arbitrary frontier alphabets. The lemma implies that if 
S G Rec(E, X) and T G Rec(n, Y), then S and T can be regarded as recognizable forests 
over a common ranked alphabet E U 0 and a common frontier alphabet X UY. 

Theorem 2.4.2 If S,T G Rec(E,X), then SCiT, SUT and S — T are also recognizable 
EX -forests. 

Proof. Suppose S and T are recognized by the EX-recognizers A and B, respectively. 
Let C = Ax B and define 


7 ; X —>■ C* by x {xa,xl3). 


Then 


ty = {ta,tl3) for all t G Fs(X). 


This implies that we get from C and 7 EX-recognizers for S DT, S UT and 5 — T by 
choosing, respectively, as the set of final states A' x B', A' x BcAx B\ and A' x {B — B'). 
For example, let 

C = (C, 7 ,A' X B'). 

For any t G Fs(A), 


t G T(C) iff f 7 = {ta,tl3) £ A’ x B' 
iff fGr(A)nr(B). 


That is, r(C) = SnT. 


□ 


Note that the complement Fj^{X) — T of a recognizable EX-forest T is recognizable. If 
T is recognized by a EX-recognizer A, then the complement is recognized by {A, a, A — 
A'}. 

Definition 2.4.3 Let (T^ | x G X) be an X-indexed family of EX-forests. For each 
EX-tree t we define a forest t{x ^ Tx \ x £ X), mostly written simply t{x ■£- Tx), as 
follows; 
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1° If t = z € X, then t{x ^ T^) = T^. 

2° If t = (T G So, then t{x ^ T^) = a. 

3° If t = (T(fi,..., tm){m > 1), then 

t{x ^ T^) = ... ,Sm) I Si G ti{x ^ Tj,) for i = 1 ,... ,m}. 

The forest product of the family (Tx \ x £ X) with the SX-forest T is defined as the 
SX-forest 

T{x <— Tx \ X £ X) = ^ Tx \ X £ X) \ t £ T). 

We shall usually write just T{x £- Tx). If T consists of a single T,X tree t, then 

T{x £- Tx) = t{x £- Tx). 

The trees t{x ■£- Tx) are obtained from t by replacing every occurrence of each letter x 
by a tree from the corresponding forest Tx. Different occurrences of the same letter x 
may be rewritten as different trees from Tx. 

\f xi,... ,Xn £ X, then we use the notation 

T{xi ^ Ti,... ^ Tn) 

for the forest product T{x £- Tx), where 

r = / for x = Xi {i = 1,... ,n), 

\ X for X ^ {xi,... ,Xn}. 

If the letters xi,..., x^ and their order are understood, then this notation may be further 
simplified to T(Ti,..., T„). 

The comments presented at the beginning of the section show that the definition of 
forest products also includes the cases, where T C Fy,{X) and Tx F F^iY) [x £ X) for 
any such alphabets that n = 0 whenever m ^ n. If T is a SX-forest and the 
forests Tx are DT-forests, then T{x Tx) is a (S U DjT-forest. 

Example 2.4.4 Let S = Sq U S 2 , Sq = {w}, S 2 = {u}, X = {x,y} and Y = {y,z}. If 
t = a{x,a{y,x)), Tx = {a{y,z),z} and Ty = {a{uj,y),a{z, z)}, then t{x ^ Tx,y £- Ty) 
contains eight trees, among them the tree a{a{y,z),a{a{uj,y),z)). □ 

The following special type of forest products is important. 

Definition 2.4.5 Let S and T be SX-forests and z £ X. The z-product of S and T is 
the forest product 

S -^T = T{x ^Tx\x £ X) 
where T^ = S and Tx = x for all x G X, x 7 ^ z. 
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The trees in S -zT are obtained by taking a tree t from T and substituting a tree from 
S for every occurrence of z in t. Different occurrences of 2 : may be replaced by different 
trees from S. 

Theorem 2.4.6 If T £ Rec(S,X) and Tx G Rec(S,X) for all x ^ X, then T{x <— 
Tx) G Rec(S,X). In particular, Rec(S,X) is closed under all x-products {x G X). 

Proof. Here it is convenient to use regular tree grammars. Suppose T and the forests 
Tx {x G X) are generated by the regular SX-grammars G = {N,'E, X, P, oq) and Gx = 
{Nx,'E, X, Px,ax) {x G X), respectively. We may assume that the grammars are in 
normal form and that their sets of nonterminal symbols are pairwise disjoint. Construct 
a regular SX-grammar 

G' = {N',E,X,P’,ao) 
with N' = N U |J(-^x I X G X) and 

P' = P" L) {a ^ Ox \ X & X, a —>■ X G P} U | x G X), 

where P" is P with all productions of the form a— {a £ N,x £ X) deleted. 

We claim that T{G') = T{x £- Tx). The idea is that every derivation oq =^g ■ ■ ■ =^G t 
of a tree t £ T can be imitated by the productions in P" up to the point where frontier 
letters x £ X are to be generated. Instead of generating a leaf x one transfers then by a 
production a —)■ to the beginning of a derivation which generates any tree tx £ Tx in 
place of the leaf. This means that G' can generate all of T(x ■(— Tx). On the other hand, 
every derivation in G' can be brought into this form by rearranging the applications of 
the productions suitably. Hence, T{G') C T(x Tx). For a formal proof it suffices to 
show that 

a p iff (3g G Ft.{X)) a =^*q q, p£ q{x £- Tx) (*) 

holds for all a G and p £ Fs(X). We proceed by tree induction on p. The fact that 
the grammars G and Gx are in normal form is used without comment. 

1° Let p = y £ X. Suppose there is a g G Ft,{X) such that a q and y £ q{x ■£- Tx). 
This is possible only in case q = z and y £ Tz for some z £ X. Then a ^ z £ P 
and hence a ^ Oz, az ^ y £ P'. We get the derivation 

a =^G> a-z =^G> y- 

On the other hand, all derivations of y from a in G' are of this form. Hence, if 
a =^*Q, y, then a —Oz, > y G P' for some z £ X. This means that a ^ z £ P 
and ttz ^ y £ Pz, and thus z is the required tree q. 

2° Let p = a £ Sq- 

(2a) If there is a g such that a =^q q and a £ q{x £- Tx), then there are two 
possibilities. The first one is that q = a. Then P and P' both contain a ^ a 
and we get the required derivation a =^q, a in one step. The other possibility 
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is that q = X ^ X and Px contains ax —> cr. Then a ^ ax and ax ^ cr are in 
P' and we get the derivation 


a =^G' Clx =^G' <7. 

(2b) Suppose a =^q, a. One possibility is that a ^ a G P'. Then a —u is in P, 
too, and we may choose q = a. The only alternative is that the derivation is 
of the form a =^g' o-x =^G' ^ for some x G X. Then a ^ x G P and a G Tx, 
and we may put q = x. 

3° Let p = a{pi,... ,pm) {m > 0). 

(3a) Suppose we have a tree q such that a =^q q and p G q{x -G- Tx). Again 
there are two cases to consider. If q = z G X, then p G Tz, a ^ z G P and 
ttz P- Now a ^ ttz G P' and, since Pz C P', we get 

a =^G' az =>G' P - 


The other possibility is that 


q = a{qi,... ,qm) 

for some qi,... ,qm G Ps(A). Then 

Pi G qi{x ^Tx) (i = 1,..., m) 
and the derivation a =^q q must begin with a step 

a e? (t(q.i,..., azn) 

such that 

Oj =^Q qi for i = 1 ,..., m. 

Our silent inductive assumption yields 

o-i =^G' Pi for i = f,... ,rn. 

Combining these derivations with a -G a{ai ,..., am) G P' we get a =^q, p. 

(3b) Suppose a =^*q, p. This could mean that a —>■ z G P and Uz ^G. P fo^ 
some z G X. Then we may choose q = z. The other possibility is that the 
derivation takes the form 

a =^G' rr(Q,]^,..., Um) ^g' rr(pi, • • •, Pm)■ 

Then there exist SA-trees qi such that 

ai =^*G Qi, Pi G Qi{x ^Tx) (z = 1,..., m). 

Now we may put q = cr{qi,..., qm)- Gl 
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Next we generalize the iteration operation taking the x-products as the starting point. 


Definition 2.4.7 Let T be any SX-forest and let x G X. Put = {x} and 

j'j+^,3: _ j^j,x \j J'Lx 

for all j > 0. Then the x-iteration of T is the SX-forest 

T*x ^ I j > 0). 

The forest T*^ is obtained as follows. First include x. New members of T*^ are 
obtained by substituting in some t £ T for every occurrence of x some tree already 
known to be in T**. Note that = T U x and T^’^ C Ti+i'® for every j > 0. 

Theorem 2.4.8 IfT£ Rec(S,X), then T*^ £ Rec(S,X) for each x £ X. 

Proof. Let G = {N,E,X,P,ao) be a regular tree grammar in normal form generating 
the forest T. Construct an extended regular SX-grammar G' = {N', S, X, P', A'), where 

(1) N' = NC{d} {d^N), 

(2) P' = P U {d ^ x} L) {a ^ r \ a ^ X £ P, uq ^ r £ P}, and 

(3) A' = {ao,(i}. 

It is not hard to see that T{G') = T*^. □ 

The following operation may be seen as a converse to the x-product. 

Definition 2.4.9 Let S and T be SX-forests and let x £ X. The x-quotient of T by S' 
is the forest 

S--T = {p£ Fs(X) I 5 M n T / 0}. 

If S' = {s} is a singleton, then we write S~^T = s~^T. 

A tree p is in S~^T iff one can convert it into a tree in T by substituting for every 
occurrence of x a tree from S'. If S is unary and X = {x}, and if we identify the tree 
iTfc(... (Ti(x)...) with the word ai... ak, then 

S-^T = S-^T = {u£E* \ SunT 

is the usual (left) quotient language. 

Theorem 2.4.10 IfT £ Rec(S,X) and S is any EX-forest, then S is recognizable 
for every x £ X. Moreover, the number of different x-quotients S~^T for any fixed 
T £ Rec(S,X) is finite. 
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Proof. Let A be a SA-recognizer for T. We define an NDF SA-recognizer 

B = (A,/3,A') 

which is identical to A (when states a € A and singleton sets {a} are identified) except 
for the initial assignment which is defined so that 

xj3 = Sa 


and 


zj3 = {za} for all z G X,z ^ x. 


Here Sa is the set of all states sa in which A may be after reading a tree s from S. By 
tree induction one verifies that 


tp = {S -a; t)a 


for all t G Fy,{X). Hence 


t G r(B) iff 
iff 
iff 
iff 


f/3 n A' ^ 0 
{S t)a n A' / 0 
S-^tOT^tb 

t G S-^T 


for all t G Fj](X). This implies S = T(B). The second statement follows from this 
construction as the number of possible /3’s is finite. □ 


Next we introduce the forest operation corresponding to the u-catenation of trees which 
was defined in Section 12.11 

Definition 2.4.11 Let cr G S be an m-ary operator and let Ti,..., be m SA-forests 
for some m > 0. The a-product of the forests Ti,..., Tm is the forest 

a{Ti, . . .,Tm) = . . . ,tm) \tl GTi,. . . ,tm G Tm}. 

If m = 0, then the fi-product is always {cr}. In general, 

(xiTi , . . . , Tm} — , . . . , Xm}} Tf, . . . , Xm Tm}- 

From Theorem 12.4.61 we get the following result which could easily be proved directly, 
too. 

Corollary 2.4.12 If a G 'Em and Ti,..., Tm G Rec(S, A) (m > 0), then (j{Ti ,..., Tm) G 
Rec(S,A). □ 

We shall now consider some operations in which forests are generally transformed into 
forests over another ranked alphabet. The ranked alphabets will be E and fl. Moreover, 
we introduce for every m > 0, a new alphabet 

— {?!) ■ ■ ■ 

which is assumed to be disjoint from all other alphabets. 


69 




2 TREE RECOGNIZERS AND RECOGNIZABLE FORESTS 


Definition 2.4.13 Suppose we are given a mapping 

hx:X^Fn{Y) 


and for each m > 0 a mapping 


hm ■ Fn{Y U 

The tree homomorphism determined by these mappings is the mapping 

h : Fs(X) ^ Fn{Y) 


defined as follows: 

1° h{x) = hx{x) for each x G X. 

2° h{a{ti,.. .,tm)) = /im(cr)(6 ^ h{tm)) for all m > 0, G and 

ti,... ,tm ^ F-e{X). 

The tree homomorphism h is said to be linear if no letter appears more than once in 
for any m > 0 and a G 

To define such an h it obviously suffices to give hx and the mappings hm for which 
Em / 0 - 

Example 2.4.14 Let S = S 2 = { | }, fl = fli U ^ 2 , fli = {^}, LI 2 = {V} and X = Y = 
{x,y}. Define hx and /i 2 by the conditions 

hx{x) = X, hx{y) = y and /i2(|) = V('(Ci)/ (6)). 

If we interpret | as the Sheffer stroke (i.e., the 2-place NAND), V as the symbol of 
disjunction and ' as the symbol of negation, then the tree homomorphism h defined 
by hx and /12 transforms [-expressions in variables x and y into equivalent expressions 
which use V and ' only. If the more customary way to write Boolean expressions is used, 
we get, for example, 


/i((x|y)|(x|x)) = h{x\yy y h{x\xy 

= {x' V y'y V {x' V x'y. 

This tree homomorphism is linear. □ 

Tree homomorphisms are not really homomorphisms in the sense of algebra. The 
concept is the result of the dual nature of words. When one generalizes from languages 
to forests, words are usually treated as unary terms. On the other hand, many concepts in 
language theory arise from the interpretation of words as elements of a free monoid. Here 
the initial concept was that of a homomorphism from the free monoid generated by an 
alphabet E to the free monoid generated by another alphabet Ll. Such a homomorphism 
rewrites every letter in a word over S as a word over Ll. When E and D are now viewed 
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as unary ranked alphabets, this means that every operator from E is rewritten as a piece 
of ri-tree to be combined with other such pieces to form the image of a given S-word. 
The generalization of such mappings to the case of arbitrary ranked alphabets gives tree 
homomorphisms. 

The following example shows that tree homomorphisms do not always preserve recog- 
nizability. 

Example 2.4.15 Put E = Si = {ct}, X = Y = {x} and fl = O 2 = {w}- Define hx and 
hi so that 

hx{x) = X and hi{a) = 

All EX-trees are of the type 

ik = .. a(x) ...)) = a^{x) {k > 0). 

Obviously, h{to) = hx{x) = x and, for all k >0, 

h{tk+i) = Lj{h{tk),h{tk)). 

Thus h{FY:(X)) consists of the trees 

So X, Si Cj(x, x), . . . , Sfc_|_i 'Sfc), • • • • 

Suppose A = A') is an DT-recognizer such that T{A) = /i(Fs(A)). There 

must exist two integers i, j > 0, i 7 ^ j, such that SjO: = Sja. But then 

ti;(sj, Sj)a = a;'^(sjQ;, Sjo) = w^{sia, Sjd) = Sj+id € A' 

would imply u}{si,Sj) € /i(Fs(A)). Thus /i(Ts(A)) cannot be recognizable. □ 

The nonpreservation of recognizability in Example 12.4.151 is due to the ability of the 
tree homomorphism to create arbitrarily large identical subtrees by copying. No tree 
recognizer can check whether trees of unbounded height are identical or not. Such 
copying is precluded by linearity, and the following closure theorem holds. 

Theorem 2.4.16 If h : Fj]{X) —>■ Fq(Y) is a linear tree homomorphism and T G 
Rec(E, A), then h{T) G Rec(D,y). 

Proof. Let G = (A, E, A, P, oq) be a regular tree grammar in normal form generating T. 
We may assume that G has no superfluous nonterminal symbols from which no EA-tree 
can be generated. Let E' and be the ranked alphabets which are obtained by adding 
all nonterminal symbols a G A to E and D, respectively, as nullary operators. We extend 
/i to a tree homomorphism 

h' ■. F^,{X) ^ F^,{Y) 

by continuing /iq to a mapping 


L'o : EoUA^Po'(R) 


71 




2 TREE RECOGNIZERS AND RECOGNIZABLE FORESTS 


so that /io(a) = cl for all a € N. Now let 


G' = (N,n,Y,P',ao) 


be the regular ny-grammar, where 

P' = {a —)■ h'(p) I a —> p G P}, 

i.e., G' is obtained simply by replacing in every production a ^ p (z P the right-hand 
side by the tree h'{p). The theorem follows when we show that T[G') = h(T). This 
again is obvious once we have shown that 

a =^Qi t iff (3s G h{s) = t, a s (*) 

holds for all a G N and t G Fq{Y). We prove the two directions of (*) separately. 

Suppose a t for some a £ N and OT-tree t. We prove the existence of the required 
s by induction on the length of the shortest derivation of t from a. 

1° If t is obtained by a one-step derivation, then P' contains the production a ^ t. 
Then P contains a production a ^ r such that h'{r) = t. If r does not contain 
any nonterminal symbols, we may put s = r. Otherwise we choose for every b £ N 
appearing in r a tree Vb £ Ts(-T) such that b =^q rt- Let s be the tree obtained 
by substituting in r these trees for the corresponding nonterminal symbols. Then 
h{s) = h'{r) = t since h' deletes all nonterminal symbols from r. Moreover, 

a r ^G s, 

and s is the required tree. 

2 ° Suppose now that the derivation consists of k steps {k > 1) and that (*) holds 
whenever a shorter derivation exists. The first step must be the application of a 
production a —)■ h'{p), where a ^ p £ P. Since G is in normal form, 

p — ■ ■ ■ I Clm) 

for some m > 0, a £ Ym and oi,..., G N. The derivation of t can now be 
written in the form 


a =^G' hm (o')(6 ) =^G' ■ ■ ■ =^G' L 


For each (i = 1,... ,m) which is present in hm{o') we have a subderivation 


ai =^G' ■ ■ ■ =^G' L(G Fq{Y)) 

of length less than k. The linearity of h implies that such a appears in hm{(y) 
exactly once, and hence U is unique. For every ti there is an Sj G Fy:{X) such that 
h{si) = ti and ai =^q Si. If a certain does not appear in hmicc), then we choose 
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any Sj G Fs(X) such that a* =^q Si and put U = h{si). With these choices we get 
a tree 

s = cr(si,... ,Sm) G 

such that 


fl C? ■>■■■■! O^m) ^G ) • • • ) 'Sm) — 'S 


and 

/l(s) = hm{cr){il ^ h{si), h{Sm)) = t. 


Now we shall prove the converse part of (*). Suppose a =^q s and /i(s) = t for some 
a € N, s G Fy;{X) and t G Ffi{Y). To show that this implies a =^q, t we proceed by 
induction on the length of the shortest derivation a =^g ■ ■ ■ =^G s. 


1° If there is a derivation of length one, then it consists simply of the application of 
the production a —)■ s. But then a —)■ t is a production of G' and a =^g' t is the 
required derivation. 

2° Suppose now that the derivation is of the form 

a =^G o-(ai,..., Um) =>G • • • =^G <7(si , . . . , S 772 ) — 

where m > 0, a G and oi,..., G N. For every i = 1,... ,m there is a 
shorter derivation 

Qi =^G ■ ■ ■ =^G Si. 

Hence, Ui =^q, h{si) for each i = 1,... ,m. Moreover, P' contains the production 

(I y hfji ('^)(Ci ^ 0,1 1 ■ ■ ■ 1 ^ Q-m) 

corresponding to the production a —< 7 ( 01 ,..., am) of G. Now the required deriva¬ 
tion is 

a ^G' hfYi ) ^G' ■ ■ ■ 

^G' hm{(^){Cl h[si), . . . , ^m h{Sm)) 

= h{s) = t. 

This concludes the proof. □ 


Next we show that arbitrary inverse tree homomorphisms preserve recognizability. We 
need the following technical lemma. Its proof is left as an exercise. 

Lemma 2.4.17 Consider a T.-algebra A and a mapping a : X ^ A, where X O A = tb. 
Let 

a : Fy. {F U yl) —> M 

be the unique homomorphism such that a\X = a and a\A = 1a- Then a|Ts(-^) = d 
and 

P(6 Pk)a = P(6 ^ • • •, 4 ^ Pka)a 

for all k > 0, p G Fy{X U and pi,... ,pk G Fy(X). □ 
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Theorem 2.4.18 Let h : Fs(X) —> Fq{Y) be a tree homomorphism. IfT^ Rec(0,y), 
then h~^{T) G Rec(S,y). 

Proof. Let A = (A, T, a, A') be an Oy-recognizer for T. We construct a EX- 
recognizer B = (A, S, X, /3, A') as follows. For any m > 0, u G Sm and ai,..., G A, 
we put 

CT (oi, . . . , Um) — hm (<7) (^1 i Oi, . . . , i (lrn)o:^ 

where a : Fq{Y U A) —)• A is the homomorphism for which a\X = a and a|A = Ia- In 
the special case m = 0, we get cr® = hQ{a)a = ho{a)a. The initial assignment is defined 
by putting 

xj3 = h{x)a for all x G X. 

Now a proof by tree induction shows that 

S/S = h{s)a 

for all s G Fe(X). Hence, s G r(B) iff h{s) G T(A). This means that h-^{T) = r(B) 
is recognizable. □ 

As a conclusion we consider a simple, but very important special type of tree homo- 
morphisms. 

Definition 2.4.19 A tree homomorphism h : Fy,{X) — Fq{Y) is called alphabetic if 
the defining mappings hx and hm (nr > 0) satisfy the following conditions: 

(1) hx{x) G y for all x € X. 

(2) hm{cr) = w(^i,... ,^m), where to G Om, for all m > 0, cr G Sm- 

An alphabetic tree homomorphism Fy,{X) — Fq{Y) can be defined only in case Dm 7^ 
0 for all such m > 0 that Em 7 ^ 0 . Alphabetic tree homomorphisms are often called 
projections. 

Consider the general alphabetic tree homomorphism of the definition. For any t G 
Fs(X), the image h{t) is obtained simply by rewriting every x in t as the letter hx{x) 
and every a G Em as the operator w, where hmicr) = ... ,Cm)- Hence h preserves 

completely the “shape” of the tree t. Obviously, h is linear. From Theorems 12.4.161 and 
12.4.181 we get 

Corollary 2.4.20 Let h : Fj](X) —>■ F^(Y) be an alphabetic tree homomorphism. 

(i) IfT £ Rec(S,X), then h{T) G Rec(0,y). 

(ii) IfT £ Rec(H,y), then h-^{T) £ Rec(S,X). □ 
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2.5 REGULAR EXPRESSIONS. KLEENE’S THEOREM 

Kleene’s theorem is of central importance in the theory of finite automata and it is 
quite natural that it was among the first results to be generalized to the theory of 
tree automata. Although the greater generality adds some technical complications, the 
standard development of the theory can be followed quite close here, too, once the right 
generalizations of the basic concepts have been found. 

We fix again an arbitrary ranked alphabet S and an arbitrary frontier alphabet X. It 
turns out that some additional frontier symbols are needed in the construction of regular 
forests. Therefore we will operate with an extended alphabet Z which contains A as a 
subset. 

Definition 2.5.1 The set of regular HZ-expressions RE(E, Z) is defined as the smallest 
set RE such that the following conditions are satisfied: 

1° 0 G RE. 

2° HqUZ C RE. 

3° If C, ?? G RE, then (C + ??) G RE. 

4° If C, ?? G RE and z € Z, then {( rj) G RE. 

5° If C G RE and z G Z, then (C*^) G RE. 

6° If m > 0, a G rji,... ,r]m G RE, then a{rii,... ,r]m) G RE. 

Thus regular SA-expressions are strings of symbols from S U A, of commas etc. Parts 
2° and 6° of the definition imply that every HZ-iree is a regular SA-expression. Regular 
expressions are intended as representations of forests. 

Definition 2.5.2 The forest \ri\ represented by a regular expression r] G RE(S,A) is 
defined following the inductive form of Definition 12.5.II 

1° |0| = 0 (the empty forest). 

2° If r/ G So U Z, then \r]\ = {r]}. 

3° |(C + f?)l = ICI U |??|. 

4° \{C-zV)\ = ICI \ri\- 

50 ^ 

6° \a{r]i,...,Tjm)\ =cr(|r/i|,...,|T/m|)- 
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Note that the operations in the right-hand sides of 3° — 6 ° are forest operations which 
have been defined in Section [231 It is easy to see that every tree t G F-£[Z) represents, 
as a regular expression, the one-element forest {t}. 

With this interpretation in mind we may simplify regular expressions by omitting 
parentheses that are not needed in order to specify the intended order of the opera¬ 
tions. First of all, the outermost parentheses in ((" -|- rj), ((" r]) and (C*^) are obviously 

superfluous if the expressions do not appear as parts of other expressions. We may 
also agree that iterations precede products and that products precede unions. Then the 
parentheses around can always be omitted and, for example. 


C + h -x o*y 


is interpreted as a short form for 


(C + (h r^)))- 


Example 2.5.3 Let S = Sq U Tj 2 , Sq = {cu} and E 2 = {u} and Z = {x,y}. The forest 
represented by 

r] = u} -y a{x,y)*y 

contains the trees ui, a{x,uj), a{x,a{x,uj)) etc. Note that y has a purely auxiliary 
function; it does not appear in any tree of the forest \r]\. □ 

In the following dehnition we make the formal distinction between letters that may 
appear in trees of the forest represented by a regular expression and those letters that 
are used just to mark leaves to be rewritten when products of forests are formed. 

Definition 2.5.4 Suppose a regular SZ-expression can be written in the form 

C = u{r] e)v 

where r],9 & RE(S,Z) and z a Z. Then every occurrence of z within the string -^9 is 
said to be bound. An occurrence of a letter z a Z which is not bound is free. A letter 
z G Z is bound in ( G RE(S, Z), if all occurrences of 2 ; in ^ are bound, and it is free in ( 
if it has at least one free occurrence in f. We denote by Z^ the set of letters z ^ Z free 
in 

In Example 12.5.31 Zy = {x} and y is bound by the y-product. 

Lemma 2.5.5 For any rj G RE(S,Z), |r/| G Rec(S,Zj^). 

Proof. We proceed by induction following the six parts in Dehnitions 12.5.11 and 12.5.21 
1° Z 0 = 0 and |0| = 0 G Rec(S,0). 

2° For each z £ Z, Z^ = {z} and | 2 ;| = {z} G Rec(S, { 2 ;}). For a G Sq, Z^- = 0, but 
still |(t| = {u} G Rec(S,0). 
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3° If rj = ( + 0, then U Zq and \r]\ = |C| U |0| G Rec(S, Z^j) by Lemma r2.4.1l 

and Theorem 12.4.21 

4° If 7] = C -z 0, then (if we omit the trivial case z 0 Zg, |r/| = |0|) Z^ = Z^ U {Zg — z). 
There are two cases to consider. If z ^ Z^, then Z^ = (Z^UZg) — z. Prom Theorem 
12.4.61 we know that \r]\ G Rec(S, Z^ U Zg). Thus it suffices to show that no tree 
t G \r]\ contains any occurrence of z. But this is obvious since every such t is 
obtained from some s G |0| by replacing every occurrence of by a tree from Id, 
and no tree in |d contains z. If 2 ; G Z^, then Z^ = Z^ U Zg and \r]\ G Rec(S, Z^) 
follows directly from Theorem 12.4.61 

5° If rj = {z G Z), then Z^ = Z^Uz. Thus |C| G Rec(S,Z^) by Lemma r2.4.11 This 
implies G Rec{T,, Z^^) by Theorem 12.4.81 

6° If r] = cr{r]i ,..., Pm), where m > 0, cr G and \r]i\ G i?ec(S, ZrjJ (i = 1,... , m), 
then Zfj = Z^^ U.. -UZ^^ and every \r]i\ is also a recognizable SZ^-forest. Corollary 
12.4.121 yields now \r]\ G Rec(S,Z^). □ 


The operations (finite) union, z-product and z-iteration are called the regular opera¬ 
tions. A forest is regular if it can be constructed from finite forests by applying a finite 
number of regular operations. In view of the preceding discussion regularity can also be 
dehned as follows: 

Definition 2.5.6 A SX-forest T is regular if there exist an alphabet Z {X C Z) and a 
regular SZ-expression rj such that jr/j = T. 

Note that an unlimited number of auxiliary letters (z € Z — X) is allowed in a regu¬ 
lar expression representing a regular forest, but that in any particular case just a finite 
number of them are needed. Lemma 12.5.51 implies now that all regular forests are recog¬ 
nizable. The next lemma contains the converse statement. 

Lemma 2.5.7 For any T,X-recognizer A one can construct a regular expression rj G 
RE(S, X U A) (we assume A n A = such that \ri\ = T(A). 

Proof. The proof is modelled after the almost standard proof for the corresponding fact 
in the language case (due to R. McNaughton and H. Yamada (I960)). The notation can 
be simplified by assuming that 

A = {1, 2 , ... ,k} for some A; > 1. 

As in Lemma 12.4.171 let 

a : J~Y.(yX U A) —y A 

be the homomorphism such that a\X = a and a\A = 1^. For any i ^ A, K <Z A and h, 
0 < h < k, we denote by T{K, h, i) the set of all f G Fe(A U K) such that 

(1) ta = i and 
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(2) sa G {1,... , /i} for all s G sub(f) — (X U Sq U t). 

Thus t G T(K, h, i) means that the leaves of t may be labelled, besides frontier letters 
and nullary symbols, by states from K. Moreover, the computation of A on t results in 
state i and the state of A at any node between the frontier and the root is in the set 
{1,..., /i}. Obviously, 

T(A) = lJ(r(0,A:,i) liGA'). 

It suffices therefore to show that all sets T{K,h,i) are regular. To do this we proceed 
by induction on the number h. 

1° When h = 0, no intermediate states between the frontier and the root are allowed. 
Every tree t in T{K, 0, i) must hence be of one of the following types: 

(i) t = X € X and xa = i. 

(ii) t = i ^ K. 

(iii) t = a ^ with = i. 

(iv) t = a{di, ..., dm) with m > 0, dj € X U T,q U K (j = 1,..., m) and ta = i. 

In each case a regular expression for {t} can be written. The number of such trees 
t is finite and we get a regular expression for T(K,0,i)- 

2° Suppose we already have a regular expression for each T{K,j,i) such that j < h 
for some h < k. We show that 


T{K,h + io)= n 

T{K, h, i) U T{K, h,h + l) -h+i T{K Uh + l,h,h + -^+1 T{K Uh + l,h,i) 

holds for all A C A and i & A. This will complete the induction because the 
right-hand side of (*) is obtained by regular operations from forests for which we 
already have regular expressions. 

Let T be the right-hand side of (*). From the construction of T it is obvious that 
T C T{K, h + 1, i). If t G T{K, h + 1, i), then either t G T{K, h, i) or t has a proper 
subtree s 0 X U Sq such that sa = h + 1. In the former case we get t G T directly. 
In the second case we have 

^ € {p \, ■ ■ ■ , Pd} 'h+l {o'!!) • • • ) Qlei } 'h+1 ■ ■ ■ 'h+1 {djl^ ■ ■ ■ ) Qjej } 'h+1 {^}) 
for some 

Pi,--- ,Pd € T{K,h,h + . -,qiei,- - -,qji,-- - ,qjej € T{K Ch + l,h,h + T) 

and r G T{K U /i -|- 1, h, i). This means that t belongs to the second part of T. □ 


Combining Lemma 12.5.51 and Lemma 12.5.71 we get the following generalized form of 
Kleene’s theorem. 

Theorem 2.5.8 A forest is recognizable iff it is regular. □ 
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2.6 MINIMAL TREE RECOGNIZERS 

The number of states is a simple and natural measure of the complexity of a finite 
automaton. In this section we consider minimal recognizers of forests. In the case of a 
recognizable forest minimality means simply a minimal number of states, and there is 
always a minimal recognizer which is unique up to isomorphism. All tree recognizers 
recognizing a nonregular forest must be infinite and counting the number of states does 
not make any sense. Nevertheless, the general definition of minimality is such that 
the minimal recognizer of a forest remains unique even in such a case. The minimal 
recognizer of a forest can be derived from any recognizer of this forest. If the forest is 
recognizable, then the minimalization procedure is effective. Otherwise, the finiteness 
of the recognizers is not needed in this section. Also, some of the concepts and results 
presented here will be applied to infinite tree recognizers in the next section. Thus we 
will temporarily drop our general assumption that all tree recognizers dealt with are 
finite. In all other respects the earlier definitions and conventions remain valid. 

We shall now define homomorphisms, congruences and quotients of tree recognizers. 
The reader may find it helpful to review the corresponding material from Section 11.21 
before going on. Algebraic functions and elementary translations (cf. Sect. 11.31) will also 
be needed. 

Definition 2.6.1 A homomorphism from a SA-recognizer A to a SA-recognizer B is 
a mapping (p : A ^ B such that 

(1) (/j is a homomorphism from the S-algebra A to the S-algebra B, 

(2) ap = I3, and 

(3) B'p-^ = A'. 

If is a homomorphism from A to B, we write p : A ^ B. A homomorphism of tree 
recognizers is an epimorphism if it is surjective, a monomorphism if it is injective, and it is 
called an isomorphism if it is bijective. If there exists an isomorphism p : A ^ B, then 
we write A = B and say that A and B are isomorphic. If there exists an epimorphism 

: A —>■ B, then B is said to be an epimorphic image of A. A monomorphism is also 
called an embedding. 

Part (3) of Definition 12.6.11 means that the hnal states, and these only, map to hnal 
states in a homomorphism. If p is an epimorphism, then (3) implies A'p = B'. 

Lemma 2.6.2 Let A and B be two YIX-recognizers. If there exists a homomorphism 
p : A ^ B, then T(A) = T(B). 


Proof. The clauses (1) and (2) of Definition [2j6T] imply together with Lemma fl.3.6l that 

t'^{a)p = t’^{ap) = t^(/3) 
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for every t € Fy:{X). Now clause (3) shows that 

t € T(B) iff t^(/3) = G B' 

iff t'^(a) € A' 
iff t G r(A) 

for every t G Fj]{X), and the lemma follows. □ 

Definition 2.6.3 A congruence of a SA-recognizer A is a congruence g of the algebra 
A saturating A', that is, such that A'g = A'. The set of all congruence relations of A is 
denoted by C(A). 

Lemma 2.6.4 C'(A) is a principal ideal of the complete lattice C{A), and thus (C'(A), 
C) is a complete lattice itself, too. 

Proof. It suffices to verify the following simple facts: 

(i) 6a G C'(A) (which implies C'(A) A 0)- 

(ii) 9 C g ^ C{A) and 6 G C{A) imply 6 G C'(A). 

(hi) yig\ge C(A)) g C(A). 

In (iii) the supremum is to be formed in C(A). It is the generating element of the 
principal ideal. □ 

In Theorem 12.6.lUI we shall get a more useful description of the greatest element of 
C(A). 

Definition 2.6.5 The quotient EX-recognizer of a SA-recognizer A with respect to a 
congruence g is the SA-recognizer 

A/g = {A/g,ag,A'/g), 

where is defined so that xog = {xa)g for each x G A. 

The usual relations between homomorphisms, congruences and quotients hold for tree 
recognizers, too. Some of them are listed in the following theorem. We omit the proofs 
since they can be constructed exactly as the corresponding proofs in algebra. 

Theorem 2.6.6 (a) // ^? G C'(A), the natural mapping 

g^ ■. A ^ A/g, ag (a G A), 

is an epimorphism A —>■ A/g (called the natural epimorphismj. 
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(b) // y? : A — >■ B is a homomorphism, then the kernel cpip ^ is a congruence of A and 
the image 

Aip = {Aip, /?, AV) 

of A is isomorphic to A(pip~^. (In Aip Aip is the Tj-algebra {A(p,Tj) such that 
= o^\Ag> and fi is to he interpreted as a mapping from X to Aip.) 

(c) If TT C Q for some tt,q € C{A), then A/g is an epimorphic image 0 /A/tt. □ 
From Theorem 12.6.61 and Lemma 12.6.21 we get 

Corollary 2.6.7 If g & C1{A), then T{A/g) = T{A). □ 

Thus any congruence of a tree recognizer yields an equivalent recognizer which is an 
epimorphic image of the original one. If the recognizer is finite and the congruence is 
nontrivial, then a real reduction in the number of states is achieved. Obviously, the 
greatest congruence gives the smallest quotient recognizer. The construction of the 
quotient recognizer involves a merging of states which are equivalent in the sense that 
one can be substituted for another in any computation without affecting the end result. 
We shall now give a precise meaning to this equivalence of states and show then that 
the greatest congruence consists exactly of the pairs of equivalent states. 

Definition 2.6.8 Two states a and 6 of a SX-recognizer A are said to be equivalent 
and we write a ~a b or just a ~ 6, iff 

(V/GAlgi(A)) f{a)€A'-^f{b)€A'. 

To get a better intuitive grasp of this definition we recall the fact that for each algebraic 
function / € Alg]^(A) there exists a tree f € Fs(A U such that for all a € A, 

f (®) — tdia, 

where Oa : Au.^ ^ A is defined by aa\A = 1a and = a fLemma ll.3.141) . This means 
that A computes /(a) from the tree t when one assigns state a to all leaves labelled 
by f. On the other hand, every tree t € Ts(A U f) defines this way a unary algebraic 
function. Such a tree may be thought of as the unprocessed part of a SX-tree where a 
leaf labelled by a state c € A corresponds to a subtree s such that sd = c. Once a value 
a € A has been assigned to the leaves labelled by the computation may be completed. 
The equivalence of two states a and b means that the assignments f = a and f = b give 
always the same result (mod A') when such a computation is completed. 

Definition 2.6.9 The SX-recognizer A is 

(a) reduced if ~a= <5a, 

(b) connected if every state of A is reachable, i.e., there exists for every a € A a tree 
t G Fs(X) such that ta = a, and A is 
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(c) minimal if it is connected and reduced. 

That a recognizer is reduced means that no two distinct states are equivalent. To 
be connected means that every state is possible in some computation performed by 
the recognizer on some tree. By Lemma 11.3.81 a tree recognizer A is connected iff 
Xa generates A. In the case of a finite recognizer minimality really means a minimal 
number of states among equivalent recognizers. If a recognizer is not connected, then 
the nonreachable states can be discarded without changing the forest recognized. If A is 
finite and ~a> <^A 5 then A/~a is a properly smaller recognizer equivalent to A. Hence, 
a finite tree recognizer can be minimal with respect to the number of states only if it is 
minimal in the sense of Dehnition 12.6.91 The converse will be established later. 

Theorem 2.6.10 For any EX-recognizer A, ~ is the greatest congruence of A and 
A/~ is a reduced EX-recognizer equivalent to A. 

Proof. It is obvious that ~ is an equivalence relation on A. Let a ^ b (a, b & A). For 
any two unary algebraic functions f,g€ Alg^(A), the composition 

fog:^ hAfi((/(0) (? G A) 

is a unary algebraic function. Hence 

gif (a)) € A' iff g(f(b)) G A, 

and this implies f(a) ~ f(b). By Lemma [1.3.161 ~ is a congruence of A. If a ~ 6 and 
a G A', then b = Ia(^) G A'. Thus A'~= A' and ~ is a congruence of A. Let g be 
any congruence of A. If agb and / G Alg]^(A), then g G C'(A) implies fia)gf{b). Now 
A' g = A' implies 

/(a)GA iff /(6 )gA. 

Hence a ^ b and we have shown that ~ is the greatest among the congruences of A. 
Corollary 12.6.71 tells us that r(A) = T(A/~). That A/~ is reduced follows directly 
from the fact, well-known in universal algebra, that the lattice C'(A/~) is isomorphic to 
the principal dual ideal [~) generated by ~ in C(A). Since ~ is the greatest element of 
C'(A), [~) is trivial and thus ~a/~ must be the diagonal relation of A/~. A more direct 
proof is possible, too. It is not hard to show that (a jr^ ( b ) implies a I'N-' b, and 

hence a ~= 6 ~. □ 

The quotient recognizer A/~a is often called the reduced form of A. It is clear 
from Theorem 12.6.101 that two tree recognizers having isomorphic reduced forms are 
equivalent. We show that the converse holds for connected recognizers. In other words, 
equivalent minimal recognizers are shown to be isomorphic. 

Theorem 2.6.11 Let A and B be two minimal tree recognizers. If A and B are equiv¬ 
alent, then they are also isomorphic. 
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Proof. Define ip : A ^ B so that 

{ta)ip = t/3 for all t G Fy:{X). 

We show that (p gives the required isomorphism from A to B. This involves the following 
seven points: 

(i) ip associates with every a € A a state of B since A is connected. 

(ii) To show that ip is well-defined we consider the possibility that sa = ta for two 
SA-trees s and t. If s/3 ^ t/3, then s/3 and t/3 are nonequivalent and there exists 
an algebraic function / G Alg]^(;B) such that f{s$) G B' and f{t$) 0 B' (or 
conversely). By Lemma 11.3.141 there exists a tree p G Fy,{B U 0 B U A) such 
that for all 6 G B, 

f{b)=p^m, 

where /3b : B U ^ ^ B is dehned so that /3b\B = 1b and ^/3b = b. Since B is 
connected there exists for each 5 G B a SA-tree pb such that pb/3 = b. Let 

q=p{h^Pb\heB){£ Fe(A U i)). 

Consider the SA-trees qs = q{£, ^ s) and qt = q{^ ^ t). Now 

qJ=p^{/3^^) = f{sP)eB' 

and 

qtP=P^{P,^) = fm^B'. 

If we assign in q to every letter x G A the value xa, we get a function g G Alg]^(A) 
such that for each a G A, 

g{a) = q^{aa) 

where Oa : AU^ —A is defined so that aa\X = a and ^aa = a. Applying Lemma 
11.3.61 we get now 

g{sa)ip = q-^{asa)ip = qsop = QsP € B' 

and 

g{ta)ip = q-^{ata)ip = qta<p = qS ^ B'. 

This is in contradiction with our original assumption that sot = ta. Hence qs G 
r(B), but qt ^ T'(B). On the other hand, sa = ta implies qsOt = qta, and a 
contradiction with our assumption that T{A) = r(B) results. 

(iii) Reversing the roles of A and B in Part (ii) one sees that s/3 = t/3 implies sa = ta 
for all SA-trees s and t. This means that ip is injective. 

(iv) ip is surjective since B is connected. 
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(v) Let m > 0, a £ and ai,...,Om € A. There are trees ti,... ,tm £ Fy,{X) such 
thnt ci\ — ^ ^TfL — ^TTL^' d hen 




.. .,tma)^p 

(T(ti ,..., tfYi)a,(p 
a{ti, . . .,tm)P 

cr®(ti/3, . . .,tmh 

(7 (^CLiip, • • • ) 


Hence is a homomorphism from A to B. 

(vi) For each x £ X, xa^p = xaip = x/3 = xfl. Thus aip = /3. 

(vii) If ta £ A' {t £ Fj^{X)), then taip = £ B' since t £ T{A) = r(B). Similarly, 

ta(p £ B' implies ta £ A'. Hence, B'ip~^ = A'. □ 


Corollary 2.6.12 If A and B are connected EX-recognizers such that T(A) = r(B), 
then A/~a— B/~b. □ 


For every SA-forest T there is at least the inhnite SA-recognizer 

Ft = (-Fs(A),lx,r) 

where As(A) = (Fs(A),S) is the SA-term algebra. Indeed, for each t £ Fy,{X) we 
have 

t € r(Fr) iff t£T. 

Obviously Fy is connected. Hence, Et/^ is a minimal recognizer for T (the relation ~ 
will be examined more closely in the next section). To show it we shall verify that every 
quotient recognizer of a connected tree recognizer is connected. 

Let y? : A —>■ B be an epimorphism of SA-recognizers. If A is connected, then so is 
B. Indeed, let b be any state of B. There exists an a € A such that a(p = b. Since A is 
connected there is a tree t £ Fj^{X) so that a = t-^{a). Using Lemma fl. 3.6 1 we get 

t^(/3) = t^{aip) = t-^{a)(p = apt = b. 

In particular, A/~a is connected for every tree recognizer A. 

We now have everything needed for the main theorem of the section. 

Theorem 2.6.13 For every forest T there exists a minimal tree recognizer, and it is 
unique up to isomorphism. If A is any connected recognizer of T, then the minimal 
recognizer is an epimorphic image of A. In fact, A/~a is minimal. □ 

The theorem is valid for every forest. It suggests the following two-step procedure for 
finding the minimal recognizer for T once any recognizer A of T is given: 
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1° Discard all nonreachable states from A. We get a connected recognizer B such 
that r(B) = T. 

2° Reduce B by finding ~b and then constructing B/~b which is the required mini¬ 
mal recognizer. 

Both of these steps become effective when T is a recognizable forest and the given 
recognizer A is finite. 

The reachable states of A form the subalgebra of A generated by the subset Xa. This 
can be found as follows. Let Hq = Xa U {a-^ \ a G Sq} and put 

ffi+l — Hi U {(T (oi, . . . , Um) | > 0, iT G j j ■ ■ ■ ; 0,m £ H{\. 

Then 

Ho Hi C ... C A 

and Hi = [Xa] {i > 0) if Lfj+i = Hi. Such an i must exist since A is finite. 

Suppose now that we have a finite connected SA-recognizer B and consider step 2°. 
First one should find Alg]^(;B). It is finite and can be formed repeating the inductive step 
of Definition 1 1.3.131 a finite number of times. Then ~b can be determined directly, using 
the definition. Although the minimal recognizer B/~b certainly can be found this way, 
the procedure would be quite tedious in most cases. A computationally simpler method 
can be derived from the following lemma. The proof is left as an exercise. The crucial 
aid is Lemma ll.3.161 an equivalence is a congruence iff it is invariant with respect to all 
elementary translations. 

Lemma 2.6.14 Define a descending sequence ■ of equivalences on B as 

follows: (i) i?/~o= {B', B — B'} and (ii) for all i > 0 and a,b ^ B, a ~i+i 6 a 6 
and f{a) f{b) for all f G ET(R). Then ~j=~B if o,nd this holds for some 

i<\B\. □ 

2.7 ALGEBRAIC CHARACTERIZATIONS OF 
RECOGNIZABILITY 

In this section two strictly algebraic characterizations of the recognizable forests are 
presented. First some ideas from the previous section are applied to derive a generaliza¬ 
tion of Nerode’s theorem on regular languages and right congruences of the free monoid 
(cf. Theorem ll.5.6p . Then we show that the recognizable forests can be obtained by 
solving fixed-point equations of a certain kind. Again, there is a well-known precursor 
in the theory of finite automata. In fact, in the unary case the equations considered here 
reduce to Arden’s equations which give the regular languages as their solutions. 

Let S and X be fixed and denote the SA-term algebra As (A) by A, for short. In 
the previous section we noted that each SA-forest T has the (infinite) SA-recognizer 
Ft- = (A, ljv,A). Consider any SA-recognizer A such that T(A) = T. It is easy to 
verify that the extension of the initial assignment a: A —>■ A to a homomorphism 

d: A ^ A 
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is also a homomorphism of S2f-recognizers from Fj’ to A. Indeed, = d and 

A'a~^ = T{A) = T. The kernel aa~^ is a congruence of Ft with a congruence class 
for each reachable state of A. If T is recognizable, A may be chosen as finite, and then 
aa~^ is of finite index. Now, suppose Ft has a congruence g of finite index. Then 
Ft/^> is a finite SA-recognizer such that T{Ft/q) = T{Ft) = T (by Corollary I2.6.7p . 
Hence T is recognizable. The congruences of Ft are simply the congruences of J- which 
saturate T. Among these there is one of hnite index iff the greatest congruence 
of Ft is of finite index. The congruence for short) is the Nerode congruence 

of T. These observations may be summed up as 

Theorem 2.7.1 For every EX-forest T the following three conditions are equivalent: 
(i) T € Rec(S,A). 

(a) The term algebra As (A) has a congruence of finite index which saturates T. 

(Hi) The index of the Nerode congruence ~t is finite. □ 

The recognizer Ft is connected and Theorem 12.6.lOI implies therefore that Ft/ ~t is 
the minimal recognizer of the forest T. To hnd ~t for a given SA-forest T one could 
try to apply Dehnition 12.6.81 to Ft: for any s,t G Fs(A), 

s ~T t iff (Vp € Fs(A U ^)) (p(^ ^ s) G T ^ t) G T) . 

A part of Theorem 12. 7. II can be restated as follows. 

Corollary 2.7.2 A EX-forest T is recognizable iff there exist a finite E-algebra A, a 
homomorphism ip: As (A) A and a subset A' C A such that T = A'p~^. □ 

The corollary gives, in fact, just an obvious reformulation of the definition of recog- 
nizability. Without going into the subject any further here, we note that in this form 
recognizability may be defined for subsets of arbitrary algebras (and not just term al¬ 
gebras): a subset T of a S-algebra A is said to be recognizable, if there exist a finite 
S-algebra B, a homomorphism p: A ^ B and a subset H C B such that = T. 

If here A = As (A), then we get the recognizable SA-forests, and if A is the free 
monoid A*, then we get the recognizable A-languages. 

As an introduction to the theory of fixed-point equations we first look at an example 
of Arden equations. 

Example 2.7.3 Consider the two-state Rabin-Scott recognizer A defined by the state 
graph shown in Fig. 12.61 The input alphabet is S = {a, r}. 

Let Li and L 2 be the languages of all words taking A from the initial state 1 to state 
1 and 2, respectively. Then the following equations hold: 

Li = Lia 0 L 2 (t O e 
L 2 = Lit U L 2 T . 
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Figure 2.6. 


If we define a mapping 

fl: ^ 

so that for all U,V C S*, 

fl{U, V) = {Ua UVaUe,UTU Vr) , 

then ([T|) means that (Li,L 2 ) is a solution of the fixed-point equation 

(ui,U2) = n(ui,U2) . (2) 

Moreover, (Li,L 2 ) is the least solution of ([2]) when (pS*)^ is partially ordered in the 
natural way: 

(UuVi) < {U2,V2) iff U1CU2 and Vi C V2 . 

If we view S as a unary ranked alphabet and identify S{x}-trees and S-words as shown 
in Section [22] (x = e, ak{- ■ ■ ai{x) ■ ■ ■) = cJi • • • Ok), then the term algebra may 

be taken to be 

J-=(E*,S) , 

where 

cr'^(u) = ua (cr G S, u G S*) . 

In the corresponding subset algebra 

p.F=(pS*,S) 


we have the operations 

= La (o- G S, L C S*) . 

The mapping II can be dehned in terms of these operations, the empty word and unions: 

fl{U, V) = {a’f’^iU) U a^^{V) U x, rP^(C/) U rP-^(F)) . 

Using forest products we may write this as follows: 

fl(U, V) = ({o-(ui), a{v 2 ), x}(ui ^U,V 2 ^V) , 

{r(ui), r(u 2 )}(ui <^U,V2^ V)) . 

Finally, we write ([2|) in the more readable form 

vi = a{vi) + a{v2) + X 

V2 = t{vi) + t(v2) 

as a system of equations to be solved in the forest algebra pJ^ which is augmented by 
union as an operation. Union is denoted here by I-. □ 
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It is obvious that Example 12.7.31 could be repeated for any regular language and that 
the language itself is always the union of those components of the minimal fixed-point 
which correspond to final states. The interpretation of the equations in terms of forest 
operations serves as the starting point for a generalization to equations for regular forests. 
Fix again a ranked alphabet S and a frontier alphabet X. For any A; > 1, let 

Fk = {pFj:{X)f 

be the set of /c-tuples of SX-forests. We order Fj. partially by componentwise inclusion: 

{Su...,Sk)<{n,...,n) iff 5icri,...,SfccTfc . 

Then F^ becomes a complete lattice in which least upper bounds and greatest lower 
bounds are obtained, respectively, by forming componentwise unions and intersections, 
thus 



• ; 1 

ie 1 ) = 

= (]J{Sii\ieI),.. 



^■>Sik) 1 

i€ I) = 

= (f]{Sii\i€l),.. 



The least element is 0 = (0,..., 0). (We refer the reader to Section 11.41 for the lattice 
theory needed here.) 

Let 14 = {^^ 1 ; • • •) Vk} be a set of variables disjoint from S and X. With every T,{X U 
I4)-forest P we associate the mapping 

P: Fk^pF^iX) 


dehned so that 

P(Ti, ..., Tk) = P(vi Ti,... ,Vk <— Tk) 

for all (Ti,..., Tfc) G F^. A /c-tuple 11 = (Pi,..., P^) of finite S(A U I4)-forests is called 
a {T,X,k)-polynomial and we associate with it the mapping 

n: Fk ^ Fk 


dehned so that 

n(T) = (Pi(T),...,A(T)) (TGPfc). 

Lemma 2.7.4 For any {E,X,k)-polynomial II, the mapping II: Fk —>■ Fk is u-continuous. 
Proof. Let LI = (Pi,..., P^). The mapping 11 is isotone as 


P{vi ^ Si,... ,Vk Sk) F P{vi ^Ti,...,Vk^ Tk) 




2.7 Algebraic characterizations of recognizability 


obviously holds for all P C Fs(X U I 4 ) and SX-forests Si,..., Sk,Ti,... ,Tk such that 
Si^Ti,...,Sk^ Tk. Let 

To C Ti C T2 C ... 

be any ascending cu-sequence of vectors 

Tj = {Til, ■ ■ ■, Tik) G Pfc (* > 0) 

of SX-forests. Now write 

T= (|J(r*i M>o),...,lJ(r,fe |i>o)) . 

In order to prove cu-continuity we should show that 

n(T) = ( 1 J(A(T,) |i>0),...,lJ(Pfc(T,) |i>0)) , 

or equivalently, that 

P,(T) = lJ(^,(TO|i>0) {j = l,...,k). (5) 

Every tree t G Pj(T) is obtained from some p G Pj by substituting a tree from |J(rim | 
i > 0) for every occurrence of each variable Vm and each m = 1,... ,k. The number of 
occurrences of variables in p is finite. Hence there exists an i > 0 such that all trees used 
in this substitution appear in a component of Tj. Then t G Pj(Tj). This shows that the 
left side of ([5]) is included in the right side of ([5]) for each j = 1,... ,k. The converse 
inclusions are obvious since 11 is isotone and Tj < T for alH > 0. □ 

Now, using Theorem 11.4.81 we get 

Corollary 2.7.5 For any {T, X,k)-polynomial H, the mapping H: Pfc —>■ Fk has the 
least fixed-point 


[H] = \/(on‘ M > 0) 


□ 


The corollary means that [H] is the least solution of the fixed-point equation 

(ui, ...,Vk) = n(ui, ...,Vk) , (6) 


where the Uj’s are “unknowns” that assume SX-forests as their values. The equation ([6]) 
can also be written as a system of equations 


ui = Pi 


(7) 


Vk = Pk , 


89 





2 TREE RECOGNIZERS AND RECOGNIZABLE FORESTS 


where the P’s are usually expressed as formal sums of their elements (as we did in 
Example I2.7..sp . 

The finiteness of the components P* was not used in the proof of Lemma I2.7.41 How¬ 
ever, it will be essential for obtaining the main result of this section. In fact, it will 
be convenient, although not necessary, to work with an even more restricted class of 
fixed-point equations, which we shall soon introduce. Example 12.7.31 provides us with a 
guideline here, too. 

Let us extend the height function of Fy,{X) to Fy,{X U I 4 ) so that 

hg(uj) = -1 = ,k) . 

Then the S(X U I4)-trees of height 0 are 

(i) the frontier letters x € X, 

(ii) the 0-ary operators cr € Sq, and 

(hi) the trees of the form cr(uq,..., where m > 0, cr G and uq,..., Vi^ G I 4 . 

Definition 2.7.6 A (S, X, A;) -polynomial H = (Pi,..., P^) is regular, if every S(XUI4)- 
tree of height 0 belongs to exactly one Pj, and the Pj’s do not contain any other trees. 
If n is regular, then H and the corresponding fixed-point equation Q are also said to 
be regular. A SX-forest T is called equational if it can be expressed as the union of 
some components of the least solution of a regular fixed-point equation. 

The fixed-point equation in Example 12.7.31 is regular. It is easy to see that the same 
procedure applied to any Rabin-Scott recognizer will yield a regular fixed-point equation. 
Hence, every regular language is equational when viewed as a unary forest. It is also 
well-known, and easy to prove, that the components of the least solution of a system of 
Arden equations are regular. 

Example 2.7.7 Let S = Sq U E 2 , Sq = { 7 }, E 2 = {u} and X = {x,y}. Then 

H = ({x, 7 , a{vi,V 2 ), a{v 2 ,vi)}, {y, a{vi,vi), a{v 2 ,V 2 )}) 

is a regular (S, X, 2)-polynomial. The corresponding regular fixed-point equation can 
be written as the system 

{ Ul = X -I- 7 -I- (j{vi,V2) + (t{v2,Vi) 

V2 = y + cf{vi,vi) + a{v2,V2) . 

The least solution is the pair {Ti,T 2 ), where 

Ti = {x, 7 , a{x,y), a{j,y), a{y,x), a{y,-i), a{x,a{x,x)),... ] 


and 


T 2 = [y, o-(x, x), a( 7 , 7 ), a{y, y),... } 
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Let [li] = (Ti,... ,Tfc) be the least fixed-point for a given (S,X, fe)-polynomial 11. We 
define a binary relation ^)(n) in Ls(X); 

^)(n) = {(s, t) I s, t € Tj for some f = 1 ,..., fc} . 

Lemma 2.7.8 Ifli is a regular (S, X, k)-polynomial, then ^?(^) is a congruence ofJ-'j:{X) 
with at most k equivalence classes. For each congruence g of Fy:{X) of index k (k>l) 
there exists a regular (Jl, X, k)-polynomial IT such that ^(II) = g. 

Proof. Let IT = (Pi,..., Pk) be a regular (S, X, /c)-polynomial and [ft] = (Ti,..., T^) 
the corresponding least fixed-point. From the definition of ^>(11) it is clear that the 
relation is symmetric. To prove that it is reflexive and transitive, too, we show that 
every SX-tree t belongs to exactly one Tj. First we note that 

Ti = Pi{vi ^ Ti,... ,Vk <-Tk) {i = l,...,k) (8) 

as [n] is a fixed-point of 11. We proceed now by induction on hg(t). 

1° If hg(t) = 0, then t is in exactly one of the sets Pi {i = 1,... ,k) because 11 is 
regular. From ([8]) we see that t is in the corresponding Ti and that it could belong 
to some other Tj (j / i) only in case Vi € Pj. But hg(ui) = — 1 and Vi does not 
appear in 11. 

2° Consider a tree t = a{ti ,..., tm) {'m > 0) and assume that all trees of lesser height 
belong to exactly one Tj. Then there exists for each j = 1,... ,m exactly one ij 
{1 < ij < k) such that tj € Ti^. Also, there is exactly one i (1 < i < k) such that 
p = a{vi ^,..., € Pj. Clearly, 

t e p{vi ^Ti,...,Vk ^Tk) CTi . 

The uniqueness of the indices ij implies that p is the only tree of height 0 in Ps {X U 
Vk) from which t can be obtained by the substitutions vi ^ Ti,... ,Vk ^ Tk. Hence 
t belongs to Ti only. 

Now we know that ^)(n) € P(Ps(X)). It is obvious that it has at most k equivalence 
classes. (There may be less than k classes as some T’s could be empty.) To prove that it is 
a congruence relation we consider any m > 1, u G and si,..., SmPi-, ■ ■ ■ Pm € Ts(X) 
such that 

Si = ti, . . . , Sm — tm (^^(n)) . 

There are indices ii,... ,im such that 

Sj,tj € Ti. for j = 1,... ,m . 

Let cj(i;q,..., be in P*. Then 

Cj(si, . . . , ^(ti, ... , t^n) € Ti 
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by dSI). Hence 

cr-^^W(si, . . . ,Sm) = (^(n)) 

as required. 

Now, suppose Q G and let ■ ■ ■ ,Sk be the equivalence classes of g. We 

define a (S, X, A:)-polynoniial H = (Pi,..., P^) so that 

Pi = {p £ Fy.{X U 14) I hg(p) = 0, p{vi ^ Si,...,Vk-^ Sk)^ 5"*} 

for all i = 1,..., fc. The fact that is a congruence means that for each p of height 0 
there is exactly one i [I <i <k) such that 

p{vi ^ Si,... ,Vk Sk) C Si . 

Hence H is regular. We claim that ^>(n) = g. Let [II] = (Ti,... ,Tk). In order to 
prove the second statement of the lemma we show by induction on hg(t) that for all 
% 1 k 

(VtGPs(X))(tG5, ^tGPO . 

1° If hg(t) = 0, then there is exactly one i such that t £ Pi. This means t £ Si. 
From ([8]) it follows that t £Ti for the same i. 

2° Let t = cr{ti,..., tm) {m > 0) and suppose the claim holds for all trees of height < 
hg(t). Then there are unique indices ii,... ,im such that 

tj £ Si. nTi. {j = l,...,m) . 

Also, there is a unique i such that 

p = .. .,Vi^) £ Pi . 

Then 

t £ <T(5'q,.. .,Si^) =p{vi ^ Si,...,Vk Sk) C Si 
by the definition of Pj. On the other hand, ([5]) implies t £Ti. □ 


If we combine Lemma 12.7.81 and Theorem 12.7.11 we get 
Theorem 2.7.9 A forest is equational iff it is recognizable. □ 

From the first part of this section it is clear that a SX-forest T can be recognized by a 
fe-state tree recognizer iff T is saturated by a congruence of Ps(X) of index < k. From 
Lemma l2.7.8l we get a similar connection between the number of states and the number 
of variables in a regular fixed-point equation which defines the forest. 

There is also a very close connection between regular tree grammars and the fixed-point 
equations considered here. For example, the equations of Example 12.7.71 can be converted 
into the following set of productions in which vi and V 2 are nonterminal symbols: 

ui-)■ X, ui7, vi^a{vi,V2), vi^a{v2,vi), 

V 2 ^y, V2 ^ cr{vi,vi), V2 ^ cr{v2,V2) ■ 
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The resulting regular tree grammar generates Ti if vi is the initial symbol, and it gen¬ 
erates T 2 if V 2 is the initial symbol. 

On the other hand, every regular SX-grammar with k nonterminal symbols can be 
converted into a fixed-point system with k equations. This system is not necessarily 
regular, but the components of the least solution are nevertheless the regular forests 
generated by the grammar from the different nonterminal symbols. For example, if 
S and X are as in Example 12.7.71 and the productions are 

a —>■ X, a —>■ 7 , a^a{a,b), b^a{b,b), b^y, 

then the corresponding equations would be 

a = X y + a{a, b) and 
b = y + a{b,b) , 

where a and b now are the unknowns. The least solution is (T{Ga),T{Gb)), where 
Ga and Gb are the grammars which we obtain by choosing a and b, respectively, as the 
initial symbol. 

2.8 A MEDVEDEV-TYPE CHARACTERIZATION 

Our next description of the recognizable forests is a streamlined generalization of a well- 
known characterization of the regular languages given by J. Medvedev in 1956. First 
we define the family of representable forests. The theorem states then that the repre¬ 
sentable forests are exactly the recognizable forests. The representable forests are defined 
collectively for all ranked alphabets as the definition involves tree homomorphisms and 
these may take us from one alphabet to another. Recall that r(S) is the finite set of 
nonnegative integers m for which 7 ^ 0. 

Definition 2.8.1 For every pair (S,X) we dehne the “next-to-root function” 
nroot: F^{X) — (Sq U X) —>■ U((^ UXr I rn G r(S)) 

so that 

nroot {a {ti,... ,tm)) = (root(ti),..., root(tm)) 

for all m > 0, CT € and ti,... ,tm G F^{X). 

Definition 2.8.2 The elementary TiX-forests are the forests 

U{d) = root~^ {d) (d G F U X) , and (i) 

F(di,... ,dm) = nroot“^(di,... ,dm) , (ii) 

where m > 0, m € r(F), and di,..., dm G S U X. 


93 




2 TREE RECOGNIZERS AND RECOGNIZABLE FORESTS 


Note that the definitions of the U{d)- and V{di,..., dm)Aoiests presume a S and an X 
although the notations do not show this. Clearly, U{d) is the set of all SX-trees with 
the root labelled by d, and V{di,..., dm) consists of all EX-trees of height > 1 in which 
the nodes immediately above the root are labelled, from left to right, by di,... ,dm, 
respectively. Note also that U{d) = {d} when d € Eq U X. We need three more 
definitions. 

Definition 2.8.3 The restriction of a forest T is the forest 

rest(r) = {t € T \ sub(t) C T} . 

Definition 2.8.4 The elementary operations on forests are the formation of 

(i) the union of two forests, 

(ii) the intersection of two forests, 

(iii) an alphabetic tree homomorphic image of a forest, and 

(iv) the restriction of a forest. 

Definition 2.8.5 A forest is representable if it can be constructed from elementary 
forests by a finite number of applications of elementary operations. 

Now the theorem can be stated. 

Theorem 2.8.6 A forest is representable iff it is reeognizable. 

Proof. To prove that the representable forests are recognizable it suffices to note that 
the elementary forests are recognizable and that the elementary operations preserve 
recognizability. Consider any E and X. If d G Eq UX, then U{d) = {d} G Rec(E,X). If 
d G 'Em (m > 0), then 

U{d) = d(yi,.. .,ym)[yi ^ ...,ym<r- Fj:{X)) 


is again recognizable. Similarly, 

C(di ,..., djYi} — U( • • • ) Vni) (jjl U{dl), . . . , ym U{dm)') \ <7 G E^) 

is recognizable for all m G r(E) and di,...,dm G E U X. We have already seen in 
Section [2.41 that unions, intersections and alphabetic tree homomorphisms preserve rec¬ 
ognizability. Let T be the forest recognized by a EX-recognizer A. We construct a 
recognizer for rest(r). First define a E-algebra B = {AC b,E) (b ^ A) so that 


(T (6i, . . . , bm) 


a^{bi, ...,bm) iibi,...,bme A and cr-^( 6 i, ...,bm) € A' , 
b in all other cases, 
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for all m > 0, O' € and bi,... ,bm € A Lib. The initial assignment 13: X ^ Au b is 
defined so that for each x G X, 


x(3 = 


xa if xa G A' , 
b ii xa ^ . 


Consider any SX-tree t. It is easy to show that 


td if sub(t) C T , 
b otherwise. 


Hence, B = {B,f3,A') recognizes rest(T). 

We shall now show that every recognizable forest is representable. Let T = T{A) for 
some SX-recognizer A. First define a new ranked alphabet such that 

X (A U X)'^ for all m > 0 . 

We construct two representable HA-forests R and S as follows. For c G A U A we 
introduce the notation 

j c if c G A , 

\ ca if c G A . 


Then 


R = {x G X \ xa G A'} U 

U(^'« cr,Ci,..., Cm )) I (o-Ai , . . . , Cm ) G H, cr"^(ci, ... ,Cm) G A') 


The forest S is the union of all intersections 


V{ui , . . . , UjYi ) n [/((cr, 6i, ..., bm)) 1 

where for each i = 1,..., m, either 

(i) Ui G X and 6* = Uia, or 

(ii) Ui = (r, ci,...,Cfc) G Hfc (A: > 0) and 6* = ..., Cfc). 

Note that the possibility m = 0 is included at appropriate places in the definitions of 
R and S. 

Dehne the tree homomorphism 

h: Fn(A) ^Fs(A) 

so that 

hm bi, . . . , 6rn)) — ^ i (j^ ^ 0, (iT, 6i, . . . , 6m) G ^m) 
and hx = i-x- Clearly, h is alphabetic. We claim that 

T = h{P) 
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for the representable forest 


P = i? n rest(5 U rio U X) . 

Let p ^ P. If p = (<T, e) G Dq, then p G R implies € A'. Hence h{p) = a G T. If 
p = X G X, then p G R implies h{x)a = xa G A'. Again h{p) = x G T. Next we show 
that for every p G rest (S' U Hq U A) of height > 1 

h{p)a = cr'^(bi,... ,byn) , where (it, 6i,..., 6^) = root(p) . (1) 


We proceed by induction on hg(p). 


1° If hg(p) = 1, then m > 1 and 


p= {a,bi,...,bm){ui,...,Um) 


for some ui,..., Um G HqU A. Since p G S we have h{ui)a = bi for alH = 1,..., m. 
But this implies that ([I]) holds for p. 

2° Now let 


and assume that (P) holds for the trees pi,... ,Pm- As p is in 5 and 

h{p)a = a-^{h{pi)a,... ,h{pm)a) , 


it suffices to show that h{pi)a = bi for every i = 1,... ,m. We should consider 
three cases. 

(a) If Pi is of the form (r, ci,..., Cfc)(ri,..., r^) (fc > 0), then the induction hy¬ 
pothesis yields 

h{pi)a = ...,Ck) ■ 

Moreover, ... ,Ck) = bi = bi since p G S. 

(b) If Pi = {a, e) G Hq, then h{pi)a = ci'^ = bi = bi. 

(c) If Pi = X G A, then h{pi)a = xa = bi. 


Now we have completed the proof of (P). Consider any tree 


p= (fT,6i,...,6m)(Pl,---,Pm) € P • 
By using m and the fact that p G P we get 

h{p)a = a-^{bi, ...,bm)GA' . 


This implies h{p) G T and we have shown that h{P) C T. 

In order to prove the converse inclusion we show first by tree induction how to construct 
for each t G Ps(A) a tree p G rest(S' U Hq U A) such that h{p) = t: 
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1° If t = X G X, then we may choose p = x. 
2 ° If t = cr G Sq) put p = {a, e). 


2.9 Local forests 


3° Let t = a{ti,... ,tm) {m > 0) and suppose we have trees pi,... ,pm G rest(S' U 
Ho U X) such that h{pi) = L (z = 1,..., m). If we put 

p= , 

where bi = tiO for i = 1,..., m, then h{p) = t and p G rest(5'UHoUX) as required. 

Let t (z T and construct a p for t as above. To prove t G h{P) it suffices to show 
that p & R. This can again be done by tree induction: 

1° If t = X G X, then xa G A! and hence p = x G R. 

2° If t = (T G So, then a'^ G A' and p = {a,e) G U{a, e) C R. 

3° Let t = a{ti,... ,tm) (m > 0). If we use ([I]) and its notation, we get 

... ,bm,) = h{p)a = ta G A' . 

This shows that p G R. □ 

2.9 LOCAL FORESTS 

In this section a proper subfamily of the recognizable forests is introduced. We will then 
also get one more characterization of the recognizable forests, not quite unrelated to 
that given in the preceding section. 

We need the following auxiliary concept 

Definition 2.9.1 The set of forks fork(t) of a SX-tree t is defined as follows: 

1° If t G So U X, then fork(t) = 0. 

2° If t = a{ti ,..., tm) {m > 0), then 

fork(t) = fork(ti) U • • • U fork(tm) U {cr(root(ti),... ,root(tm))} ■ 

The set of all forks of SX-trees |J(fork(t) | t G Fs(X)) will be denoted by fork(S,X). 

Example 2.9.2 Let S = Sq U Si U S 2 , Sq = { 7 }, Si = {r}, S 2 = {cr} and X = {x, y}. 
For the SX-tree 

t = a{x, r(y))) , 

we have 

fork(t) = {o-(r,cr), r( 7 ), a{x,T), r(z/)} . 

Graphically these forks are represented as in Fig. 12.71 respectively. Obviously, fork(S,X) 
is always finite and here it consists of 30 forks. □ 
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Local forests may now be defined. 

Definition 2.9.3 A EA-forest T is local if there are sets i?(C EUA) and F(^C fork(E, A)) 
such that, for each t G Fy:{X), 

t € T iff root(t) € R and fork(t) C F . 

Then we write T = Loc(i?, F). 


Hence the membership of a EA-tree t in the local forest Loc(i?, F) can be decided by 
testing for the local properties root(t) € R and fork(t) C F. 

A EA-recognizer for Loc(i?, F) can be constructed as follows. First we define a E- 
algebra A = {A, E). Let A = EUAU0(0^EU A). For every a G Eq, put = a. 
For m > 0, a € E^ and oi,..., Om G A let 


(7 (oi, . . . , flm) 


a if fT(ai,.. .,am) G F , 
0 otherwise. 


Let a: A ^ A be the embedding x !->■ x (x G A). It is easy to show that for all 
t€Fs(A), 

froot(t) if fork(t) C F , 

ta = < 

I 0 otherwise. 

This readily implies T{A) = Loc{R, F) for A = (A, a, R). Hence we have 


Theorem 2.9.4 Every local forest is recognizable. □ 

The converse of Theorem 12.9.41 does not hold. For example, the forest consisting of 
the single tree of Example 12.9.21 is not local as there are many other trees with the same 
root and the same forks. However, the following fact can be proved. 


Theorem 2.9.5 For every recognizable EX-forest T there exist a ranked alphabet 12, a 
frontier alphabet Y, a local HE -forest S and an alphabetic tree homomorphism 

h: Fo(E) ^Fs(A) 


such that T = h{S). 
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2.10 Some basic decision problems 


Proof. Let G = (N,!!, X, P, ao) be a regular SX-grammar generating T. We assume 
that G is in normal form. A new ranked alphabet is defined so that 

^ ■ ■ ■ ) ®m,)] I 0. y , dm) € ^ 

for all m > 0. Also, let 


Y = {[a ^ x\ \ a ^ X & P, X & X} . 
The local nP-forest S = Loc(i?, T) is defined by the sets 

R = {[do p] I do ^ p € P} 


and 


F = {[a ct(oi, ... ,am)]([di ^ pi],..., [o^ Prn]) I m > 0, 
a a{ai , . . . , 0-771); ^ Pi 1 • • • 1 Pm € P} • 

Finally, define an alphabetic tree homomorphism 

h: Fn{Y)^F^{X) 


by the mappings 

hy.Y^ Fy{X), [o —>■ x] !->■ X, 

and 

hm • n^^F^{x\j 

^m)-) [d fT(ai , . . . , O777 )] ^ ) ■ ■ ■)• 

Now h{S) = T, and thereby the theorem, follows from ([1]) and ([2]): 

(1) If d =^*Q t, for some a £ N and t £ Fj:;{X), then there is a tree s £ Fq{Y) such that 
/i(s) = t, fork(s) C F and root(s) is of the form [d —>■ p]. 

(2) If s G Fn{Y) is such that fork(s) C P and root(s) = [a ^ p] for somep G Ps(A^UX), 
then a =^*q h{s). 

Part ([I]) can be proved by induction on the length of the derivation of t and ([2]) by 
tree induction on s. □ 

Note that h{S) is always recognizable when S' is a local forest and h an alphabetic tree 
homomorphism (Theorem 12.9.41 and Corollary 12. 4. 20() . 

2.10 SOME BASIC DECISION PROBLEMS 

In this section we shall show that some of the first questions one might ask about given 
tree recognizers are algorithmically decidable. To begin with, we have the emptiness 
problem: Is the forest recognized by a given tree recognizer empty? Or one may ask 
whether this forest is finite or infinite. This is the finiteness problem. Finally, we have 


99 
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the important equivalence problem: Do two given tree recognizers recognize the same 
forest? In fact, the more general inclusion problem: “T(A) C T(B)?” is shown to 
be decidable. The problems are quite easy and the proofs follow the strategy familiar 
from finite automata theory with a “pumping lemma” as the key result. We have seen in 
Section [22] that any nondeterministic frontier-to-root, or root-to-frontier, tree recognizer 
can be converted into an equivalent deterministic F-recognizer. Hence we may again 
restrict ourselves to our basic type of tree recognizers. 

We need the following special notation. Let S and X be given. Introduce a new letter ^ 
and let Tg be the set of all T,{X U ^)-trees in which ^ appears exactly once. For any 
q € T^ and p G F^{X) U T^ we denote q{^ ^ p) by p • ( 7 . Also, we define the powers q^ 
as follows: 

1° = e, 

2 ° q'^^^ = Q ■ (T (n > 0). 

Using these notations we may formulate the pumping lemma of tree recognizers as 
follows. 

Lemma 2.10.1 Let A be a k-state TX-recognizer. If t & T{A) and hg(f) > k, then 
there are trees p € Fs(A) and q,r &T^ such that 

(a) t = p ■ q ■ r, 

(b) hg(( 7 ) > 1 and 

(c) p ■ q^ ■ r ^ T{A) for a// f = 0,1, 2,... . 


Proof. Suppose t € T{A) and hg(f) > k{= |A|). Then we can write t = a{ti,... ,tm) 
{m > 0, a € Sm)- Choose some j {1 < j < m) such that hg(fj) = hg(t) — 1. Then 


t — tj ■ S\ , 

where 

Si = cr(fi,..., tj—\, tj+i, • • •, tm) G Tg • 

If hg(tj) > 0, we may decompose tj the same way. Since hg(f) > k the process can be 
repeated k times and finally we obtain a representation 

t t • * ... * S 2 * 5l , 


where t' G F^{X) and si,..., s^ G T^. Moreover, hg(si) > 1 for every i = 1,..., k. Let 
Uk-\-\_ — t , Uk — t * Sk , • • • , Ui — t • Sk ■ . . . ■ Si — t . 

There must be indices h and j,k-\-l>h>j>l, such that 


UhO( = uja . 
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Now let p = Uh, q = s/i_i ■ ... ■ Sj and r = Sj-i • ... • si (if j = 1, then r = ^). Then 
t = p ■ q ■ r and hg(q) > 1 . Also, our choice of p and q implies 


pd = {p ■ q)d 


( 1 ) 


We assume that An X = 0, and extend d to a homomorphism 

a: Xy, {X U A) —> A 

so that d\A = 1^. By Lemma 12.4.171 sa = sd whenever s € Ts(X). We verify now by 
induction on i that 

{p ■ q^)d = ip ■ q)d (2) 

for every i > 0. From © we know that this is true for i = 0. Suppose ([ 2 ]) holds for a 
given i. This assumption and ([T]) imply 

{p ■ = q{C ^ {p ■ = q{^ ^ {p ■ q)d)a 

= q[^ ■<— pd)d = {p ■ q)d . 


Using ([2]) we get for each z > 0, 

{p ■ q^ ■ r)d = ^(^ •(— (p • q’‘)d^d 
= r (p • q)d'jd 
= {p ■ q ■ r)d . 

Hence, p ■ q^ ■ r ^ T{A) for all i > 0. □ 

Theorem 2.10.2 Let A be a k-state TiX-recognizer. Then T{A) is nonempty iff it 
contains a tree of height less than k. Hence the emptiness problem of recognizable forests 
is decidable. 

Proof. Suppose T{A) is nonempty. Let t be a tree in T(A) of minimal length. If 
hg(t) > k, we apply the pumping lemma and write t = p ■ q ■ r. But then T(A) would 
contain the tree p ■ r which is properly shorter than t as hg(q) > 1. Hence hg(t) < k 
must hold. The converse part is trivial. The emptiness of T(A) can always be decided 
by going through the finite set of trees of height < k. □ 

Suppose two SA-recognizers A and B are given. Clearly, T(A) C r(B) iff T{A) — 
T(B) = 0. But r(A) — r(B) is recognized by 

C= (AxH, 7 ,A'x {B-B')) , 

where x'j = {xa,xf) for x G A. Thus the question “r(A) C T(B)?” can be answered 
by deciding whether r(C) is empty or not. The equivalence problem can similarly be 
reduced to the emptiness problem. Of course, its decidability follows also from the 
decidability of the inclusion problem. We have justified 
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Theorem 2.10.3 The inclusion problem and the equivalence problem of tree recognizers 
are decidable. □ 

Finally we consider the finiteness problem. 

Theorem 2.10.4 It is decidable whether the forest recognized by a given tree recognizer 
is finite or infinite. 

Proof. Let A be a fc-state SA-recognizer and write 

r = r(A)-{t€Fs(A) |hg(t)<A:} . 

We claim that T{A) is finite iff T = 0. Obviously the condition is sufficient since the 
set of SA-trees of height < A; is hnite. If T 0 and t G T, then hg(t) > k and we may 
apply the pumping lemma and write t = p ■ q ■ r so that 

p ■ q^ ■ r ^ T{A) for all i > 0 . 

These trees are pairwise distinct since hg(g) > 1. Hence T{A) is infinite. The forest T 
is recognizable and one can easily construct a recognizer for it. This means that the 
condition T = 0 is effectively testable. □ 

The decidability of the finiteness problem may also be deduced from the following 
corollary of the pumping lemma. The proof is an exercise. 

Lemma 2.10.5 Let A be a k-state tree recognizer. Then T{A) is infinite iff it contains 
a tree t such that 

k < hg(t) < 2k . □ 

2.11 DETERMINISTIC R-RECOGNIZERS 

In Section 12.21 it was shown that NDR-recognizers recognize exactly the family Rec, 
but that there are recognizable forests that cannot be recognized by any deterministic 
R-recognizer. The limited recognition power of DR-recognizers is due to the fact that 
they have no way of combining the information gathered from disjoint subtrees. This 
implies that a DR-recognizer will accept any tree in which every path from the root to 
the frontier appears in some tree accepted by the recognizer. It will turn out that this 
closure property characterizes the forests recognizable by DR-recognizers. Here a “path” 
contains, not only a list of the labels of the nodes traversed, but also the information 
about the directions taken at the nodes. In the later part of this section we shall consider 
the minimization of DR-recognizers. It will be shown that every DR-recognizer can be 
reduced to a canonical minimal form which is unique up to isomorphism. 

Let S be a fixed ranked alphabet. In order to avoid some troublesome technicalities, 
we shall assume that Sq = 0. We associate with S a unary ranked alphabet 

r = Fi = lj(r(a) I u G S), 

where for all u, r G S, 
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(i) r(fT) = {(Ti,..., am} if CJ G Sm (m > 1 ), and 

(ii) r((T) n r(T) = 0 if <7 7 ^ r. 

The paths in S-trees can now be defined as T-trees. 


2.11 Deterministic R-recognizers 


Definition 2.11.1 Let X be any frontier alphabet. For each x ^ X the set gx{t) of 
x-paths of a SX-tree t is defined as follows: 

1° gxix) = {x}, and g^iy) = 0 for all y ^ x, y £ X. 

2° If t = a{ti, ...,tm) (cr G Sm, m > 0), then g^it) = aiig^ih)) U • • • U am{gx{tm))- 

We extend gx to a mapping from pFj^{X) to pFr(X) in the natural way. Moreover, we 
put 

g{T) = \J{gx{T) \ x £ X) 

for each T C Fs(X). 

Label the edges of the graph representing a tree t £ Fj^{X) so that the edge (counted 
from the left) leaving a node labelled by a symbol a always gets the label o'*. Then the 
elements of gxit) {x £ X) are spelled out by the paths leading from the root to a leaf 
labelled by x when we interpret a word cJiij ... a^i^x {k > 0, ciij^,..., a^i^. G T) as the 
TX-tree .. aki^{x )...). Moreover, every such path gives an element of gx{t). 

Lemma 2.11.2 If T £ Rec(S,X), then g{T) £ Rec(r,X). 


Proof. Let G = (N, S, X, F, uq) be a regular SX-grammar in normal form generating 
T. The case T = 0 being trivial, we may assume that every Ga = (X, S, X, F, a) (a £ N) 
generates a nonempty forest. Let G' = (X, T, X, P', uq) be the regular TX-grammar, 
where 

P' = {a —)■ ai{ai) \ a —)■ cj(ai,... ,am) GP, m>0, l<f< m} U 
{a —)■ X I a —)■ x G P,x G X}. 

We claim that T{G') = g{T). This follows when we show that, for every tree 

p = criji(... cjfc4(x)...) G Pr(X) 


and every a £ N, 


P£T{G',) iff p£g{T{Ga)), 


(*) 


where G'^ = (X, T, X, P', a). 

We proceed by induction on hg(p). 

1° If hg(p) = 0, then p = x. In this case (*) obviously holds as a —>■ x is in P' iff it is 
in P. 

2 ° Suppose hg(p) > 0 and that (*) holds for all trees of lesser height. 
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If p € T{Ga), then a au^ and (J 2 i 2 {- ■ ■ c^ki^ix )...) for some € N, 

and P contains a production a —>■ ai{ai,... ,am) such that ii < m. By the inductive 
assumption there exists a tree ti^ € T{Gai_^) such that < 7212 (••• < 7 ^ 4 ( 2 ;) •••) € gxiUi)- 
Moreover, we may choose for every a tree ti € T{Gai)- Then 

t = ai{ti, ...,trn)& T{Ga) and p € gx{t) C g{T{Ga)). 

Conversely, let p € g{T{Ga))- Then p € gx{t) for some t E T{Ga)- Obviously, t is of 
the form where ii < m, and it has a derivation 

a =>G < 71(01 , . . . , dni) 

This means that P' contains the production a < 714 ^( 04 ). Moreover, tjj E T{Gai^) 

<724(■ ■ ■ ^kikix) ■■■) & gxiUi)- Hence, we get a derivation 


O <714(04) P, 


which shows that p ^T{G'a). □ 

Let g be the mapping of Definition 1 2.11. Il associated with a given frontier alphabet X. 
Then we write tx = gg~^■ It is clear that tx is a closure operation in i.e., for 

all S,TC FsiX), 

(i) SCStx, 

(ii) S FT implies Stx TTtx, and 
(hi) Stxtx = Stx- 

For any T C Fy,{X), Ttx is the closure of T, and T is said to be closed \ITtx = T. 
Now, consider an arbitrary NDR SX-recognizer A = {A,A',a). For each a E A, let 

T{A,a) = {t € Fs(X) | a E ta}. 

A state a E A is a 0-state, if T(A, a) = 0. We say that A is normalized if for all m > 0, 
a E Sm and a E A one of the following two alternatives holds: 

(1) Each component of every vector in o-^{a) is a 0-state. 

(2) No component of any vector of <7'^(a) is a 0-state. 

A normalized NDR SX-recognizer A has the following important property. Let p E 
gx{s) (x E X) for some SX-tree s such that A has a computation on s which begins 
at the root in an initial state and ends at the leaf corresponding to p in a state which 
belongs to xa. Then there exists a tree t in T{A) such that p E gxit)- Such a t can be 
built around the x-path p by completing it with trees from appropriate T(A, a)-forests. 

An NDR SX-recognizer A becomes normalized if we omit from each set cr'^(a) every 
vector which contains a 0-state. This does not change T{A) because the use of a vector 
containing a 0-state cannot lead to an accepting computation. Hence, we have 
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2.11 Deterministic R-recognizers 


Lemma 2.11.3 For every NDR-recognizer there is an equivalent normalized NDR-recog- 
nizer. □ 

We associate with each NDR SX-recognizer A a DR SA-recognizer pA = (pA, A', /3) 
defined as follows: 

(i) pA = (pA, S) is the deterministic root-to-frontier algebra such that 

crP-^(iL) = (^|J(7ri((T'^(a)) \ aeH),.. ., |J(7rm(o-^(a)) | a € H)^ 

for all H € pA, m > 0 and a € Here vr^ (1 < i < m) is the i^'^ projection. 

(ii) For each x € A, x/3 = {H G pA | H n xa A 01- 

Lemma 2.11.4 For every normalized NDR HX-recognizer A, T(pA) = T(A)rx. 

Proof. In order to prove the inclusion T(pA) C T{A)tx, we consider an arbitrary tree 
s € T'(pA) and an x-path p G fl'x(s) (x G A). We should show that p G g(T{A)). Let 

P = o'liA- • • i^kikix)) ■ ■ ■)■ By the definition of pA there are states ao,ai, ■ ■ ■ ,afc G A 

such that 

(i) oo G A' and G xa, and 

(ii) Qj G TTi.{af{aj-i)) for j = 1,..., k. 

Since A is normalized, this implies that there is a tree t G T{A) such that p G gx{t)- 
Hence p G g{T{A)). Now, let s G T{A)tx and consider any x-path 

p = auii -. .UfciAx)...) G gx{s) (x G A). 

Then p G gx(t) for some t G T{A) and there are states oq, ai,..., a*, G A such that the 
above conditions (i) and (ii) hold. But the definition of pA implies that the state of pA 
at the leaf corresponding to p includes for any tree in which p is an x-path. Hence 
pA arrives at the leaf of s corresponding to p in a state belonging to xa. This holds for 
every leaf of s and therefore s G T(pA). □ 

Corollary 2.11.5 IfT^ Rec(S, A), then Ttx G Rec(S, A). □ 

Lemmas 12. 11. 31 and l2. 11.^ also imply that every closed recognizable forest is recognized 
by a DR recognizer. But it is easy to see that T(pA) = T{A) if A is deterministic. Hence 
we may state the following result. 

Theorem 2.11.6 A recognizable forest can be recognized by a DR recognizer iff it is 
closed. □ 
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The rest of this section deals with the minimization of DR-recognizers. First two 
general remarks. When A = {A,aQ,a) is a DR SA-recognizer, then the NDR algebra 
A = {A, E) is deterministic and we may view each {a G Em, m > 0) as a mapping 

a-^: A^ A^. 

Hence we write cr^{a) = (oi,..., am) rather than cr'^(a) = {(ai,..., am)}- The second 
remark concerns normalized DR recognizers. If the DR SA-recognizer A is normalized, 
one of the following conditions holds for each pair (a, cr) G A x E: 

(1) Every component of o"^(a) is a 0-state. 

(2) No component of 0 -^( 0 ) is a 0-state. 

Of course, Lemma 12.11.31 and the construction which led to it are valid here, too, but 
we define a “standard” normalized form A* = {A*,ao,a) of A as follows: 

(i) If A has no 0-state, then put A* = A. 

(ii) If A has a 0-state, choose one of them, say d, and define then for all m > 0, 
a G Em, and a £ A, 

A*, . ( (d,... ,d) (£ A™') if a-^(a) contains a 0 -state, 

" = I A(«) otherwise. 

It is easy to prove that A* is normalized and deterministic, and that T(A*) = T(A). 
Normalized DR recognizers have also the following useful property. 

Lemma 2.11.7 Let A and B be normalized DR EX-recognizers, and let a £ A, b £ B, 
m> Q, a £ Em, cr-^(a) = (ai,...,am) and a'^{b) = {bi,...,bm)- IfT{A,a) = T{B,b), 
then r(A, a,) = r(B, bi) for all i = 1,... ,m. 

Proof. If one of the states a* (1 < i < m) is a 0-state, then all of them are. Moreover, 
T(A,a) = T(B, 6 ) does not contain any tree of the form a{ti,... ,tm)- Hence, one of 
the forests T{B,bi) (1 < i < m), and therefore every one of them, is empty. Thus 
T(A, ai) = r(B, bi) = 0 for all i = 1,..., m. 

Suppose now that T{A, ai) A 0 r(B, bi) 7 ^ 0 for all i = 1,..., m. Consider any i 
{I < i < m) and L £ T{A, ai). Choose any ti £ T{A, ai),..., G T(A, aj_i), tj+i G 
T{A,ai+i),...,tm £ T{A,am)- Then a{ti,...,tm) £ T{A,a) = T{B,b) implies U £ 
T(B,bi). By a symmetrical argument, T(B,bi) C T(A, a,) holds for every i = 1,..., m. 
Hence, T{A, ai) = r(B, bi) for every i = 1,..., m, as required. □ 

We shall now define a few algebraic concepts for DR recognizers. Let A = (A, ao,a) 
and B = (H, 6o,/3) be DR EA-recognizers. 

A homomorphism from A to B is a mapping A ^ B such that 

(i) for all m > 0, (T G Em and a £ A, a^{aip) = {aiip ,..., amT)-, where (ai,..., am) = 
a-^{a), 
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(ii) aQip = bo, and 

(iii) for every x £ X, x/3(p~^ = xa. 

If is a homomorphism from A to B, we write (/?: A —> B. If such a (/J is surjective, it 
is called an epimorphism. For an epimorphism condition (iii) implies xa^p = xf3, too. If 
there exists an epimorphism ip from A onto B, then B is an epimorphic image of A. If 
p: A —>■ B is bijective, then A and B are isomorphic, and we write A = B. 

A congruence on A is an equivalence relation g on A such that 

(i) for all m > 0, cr G and a, a' £ A, ag = a'g implies a'^{a)/g = a-^{a')/g (recall 
the notation from Section fl.lj) . and 

(ii) g saturates every set xa {x £ X). 

If ^ is a congruence on A, then the quotient recognizer determined by g is the DR 
SA-recognizer 

A/g = (A/g,aog,ag), 
where Ajg = {^Ajg, S) is defined by 

a'^/^{ag) = cy^{a)lg (u G Sm, m > 0, a € A), 

and A —>■ A/g is defined by xa^ = xa/g {x £ A). It is easy to see that A/g is 
well-defined. 

The following theorem is easily obtained by modifying the proofs of the corresponding 
facts from algebra. 

Theorem 2.11.8 Let A and B be DR YiX-recognizers. 

(a) If g is a congruence of A, then the natural mapping g^: A —>■ A/g defines an 
epimorphism of A onto A/g. 

(b) Ifp:A^B is an epimorphism, then g = pp ^ is a congruence on A, and 

A/g^B. □ 

The following fact will be needed later. 

Theorem 2.11.9 IfB is an epimorphic image of A, then T{A) = T(B). 

Proof. Let p: A ^ B be an epimorphism. We verify by tree induction that 

ta = tPp~^ and tap = t^, (*) 


for every t £ Fs(A). 

1° For t = X £ X, (*) follows directly from the fact that p is an epimorphism. 
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2° Let t = cr{ti,... ,trn) and assume that (*) holds for ti,... Rm- Suppose a € ta. 
If o"^{a) = (oi,..., am), this means that ai G tia,... ,am € tmd. Hence, aiip G 
ti/3,..., G tml^- This implies 

a^{a(p) = {ai(p,amT) G ti/3 x ■ ■ ■ x tmP- 

Hence, a(p G t/3. Suppose now that aip G tp, and let o-^{a) = (ai,... ,am)- Then 
aiif G tif3 ,..., G tmf3, which implies oi G fid,..., am G tmOi. Hence, a G ta. 
The equality ta = t$(p~^ implies taip = tfl as (p is surjective. 

Now, (*) implies that for every t G Fs(X), 

t G T(A) iff oo G ta 

iff ao(p{= bo) G taip{= t/3) 

iff t G T(B). □ 

We call two states a and a' of a DR SX-recognizer A equivalent, and we write a ~a o,' 
(or just a ~ o'), if T(A,a) = T(A,o'). Obviously, ~a is an equivalence relation on A. 
We say that A is reduced, if ~a= ^a- 

Lemma 2.11.10 If A is a normalized DR TX-recognizer, then is a congruence on 
A and A/~ zs reduced. 

Proof. First we show that ~ is a congruence relation. 

(i) Consider any m > 0, o G and a,a' G A such that a ~ o'. Let 

a-^{a) = {ai,... ,am) and cr'^{a') = {a[,... ,am). 

But a ^ a' means that T(A,a) = T(A,a'), and Lemma l2. 11.71 implies that 
T(A, Oj) = r(A, o') for all z = l,...,m. 

Hence, Oj ~ o' for all z = 1,..., m. 

(ii) If o G xa and a ~ o', for some x G X and o, o' G A, then x G T(A,o) = T{A,a') 
implies o' G xa. Hence, ~ saturates xa. 

Now we know that the quotient recognizer A/~ can be defined. It is reduced as 
(a~) ~A/~ (“^~) implies o~ = o'~ (a, a' G A) because, by Theorem 12.11.91 

r(A, o) = r(A/~, o~) = r(A/~, o'~) = r(A, o'). □ 

Let a, o' G A. We write o L o' if there exist an m > 0 and a o G such that o' 
appears in a'^{a). The reflexive, transitive closure of L is denoted by h*. If o h* o', we 
say that o' is reachable from a. The DR recognizer A is said to be connected if every 
state is reachable from the initial state. 

The connected component 

= {A'^,aQ,a^) 

of a DR SX-recognizer A is defined as follows: 
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(i) = {A^, S), where = {a ^ A \ a^'r* a} and (T'^°(a) = o-^(a) for all a € S and 
a E A'^. 

(ii) xa'^ = xad A'^ for each x ^ X. 

Clearly, the operations : A^ —)• are completely defined (m > 0,it E Sm)- 

The proof of Lemma l2.1 1.11 1 is quite straightforward and we shall omit it. 

Lemma 2.11.11 Let A be any DR TiX-recognizer. Then 

(a) is connected and deterministic, 

(b) = A iff A is connected, 

(c) r(A^) = T{A), and 

(d) if A is normalized, then so is A'^. □ 

We are now ready to present the main theorem of the minimization theory of DR 
recognizers. 

Theorem 2.11.12 Let A and B be connected, normalized DR YjX- recognizers. Then 
r(A) = r(B) iffA/^j^ ^ b/~b. 

Proof. If A/~a and B/~b are isomorphic, then 

r(A) = r(A/~A) = t(b/~b) = t(b) 

by Theorems 12.11.81 and 12.11.91 
Assume now that T(A) = r(B). We define a mapping 

(^: A/~a —^ B/~b 


by the condition that 

(a~A)¥^ = 6 ~B if r(A,a) =r(B, 6 ) (oeA, 6 eB). 

The following steps (i)-(v) show that (p is the required isomorphism. 

)p is defined for all a~A E A/ A. • Since A is connected, there exist for every 
a G A a k > 0 and states oi,.. ., E A such that 

oq b ai h 02 h • • • h Ofc = a. 

Using Lemma l2. 11. 71 one shows by induction on the smallest k (corresponding to 
the given a) that there is a 6 such that T{A, a) = T(B, 6). 

(ii) p is well-defined. If a ~a T(A, o) = r(B, 6 ) and T{A,a') = r(B, 6 ') for some 
a,a' G A and b,b' G B, then 6 ~b = 
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(iii) if is injective. Similarly as (ii). 

(iv) (f is surjective. If we exchange the roles of A and B in (i), we see that there exists 
for every 6 G B an a € A such that T(A, a) = T(B, b). 

(v) is a homomorphism. That ip preserves the operations follows from Lemma f2.ll.7l 

If a~A G xa/~A {x G X) and (a~A)7’ = b^B, then x G T(A, a) = T(B, b) implies 
6~b € x/3/~b. Likewise, {a^A)T = € x/3/~b implies a~A € xaj^A- Thus 

x/3~b 7^~^ = xa~A for every x € X. □ 


A DR recognizer A is said to be minimal if no DR recognizer with fewer states recog¬ 
nizes T{A). If A is minimal, then it is connected by Lemma l 2 .ll.lll As T(A*) = T(A) 
we may also assume that A is normalized. Then T{A) = T(A/~a) implies that A 
should be reduced, too. Conversely, if A is connected, normalized and reduced, then 
it is minimal and every normalized minimal DR recognizer is isomorphic to it (Theo- 
rem i 2 .ll.i 2 p . These facts imply that the following three steps yield for any DR recognizer 
A an equivalent minimal DR recognizer B. Moreover, this B is normalized. 

Step 1. Form A*. 

Step 2. Form A*'^. 

Step 3. Form ~ for A*'^, and put B = A*'^/~. 

It is not hard to see that these steps are effectively realizable. 

2.12 EXERCISES 

1 . Let leaf(t) denote the set of symbols which label the leaves of a given SA-tree t. 
Define leaf(f) by tree induction. 

2. (a) Define the length |t| of a EA-tree t (as a word) by tree induction. 

(b) For the sake of simplicity, let E = E 2 . Derive an upper bound for |t| in terms 
of hg(f). Give also a lower bound for |t| in terms of hg(f). 

3. Let E = Eq U E 2 , Eq = {w}, E 2 = {u}, and let A = {x,y}. Construct a CF 
grammar which generates the set Fs(A) of all EA-trees (when these are viewed as 
words). Is the set of all EA-trees still a CF language if we use the Polish notation 
for EA-terms? 

4. Let E and A be as in the previous exercise. Decide which ones of the EA-forests, 
R, S, and T are recognizable, when these are defined as follows: 

(i) t € R iff the number of ex’s in t is odd. 

(ii) f G 5 iff all paths from the root to a leaf are of the same length. 
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5. 

6 . 

7. 


9. 


10 . 

11 . 

12 . 

13. 

14. 

15. 

16. 


(iii) t G T iff no leaf labelled by y appears to the left of a leaf labelled by x. 

Let A be an NDF SA-recognizer and B an NDR SA-recognizer which are asso¬ 
ciated in the sense of Section 12.21 Prove the equality d = /3 by tree induction. 

Use regular tree grammars to prove directly that Rec(S,A) is closed under a- 
products ICorollarv 12.4.1211 . 

Let us change the definition of the forest product T{x ^ T^) (cf. Definition 12. 4. 3p 
in such a way that every occurrence of each letter x G A should be rewritten as 
the same tree tx G Tx. Then we get the new product 


T[x •(— Ta; I X G A] = {t{x ■<— fa; I X G A) \ t ^ T,tx € Tx (x G A)}. 

Is Rec(S, A) closed under this product? 

Let T be a SA-forest and let x G A. Describe the forests T -a; 0 and 0 -x T. 


Do the following laws hold for x-products? 

(a) R-x{SUT) = {R-xS)U{R-xT). 

(b) {RUS)-xT = {R-xT)[j{S-xT). 

(c) R -x {S -y T) = {R -x S) -y T. 

Let us change Definition 12.4.71 so that = T -x U T^'^ for all j > 0. 

Does the new x-iteration coincide with the original one? If not, does it preserve 
recognizability? 

Let X / y (x,7/ G A). Is it possible that (T*^)*^ ^ gQj^g SA-forest T? 

Show that the construction of the tree recognizer for the forest S~^T given in the 
proof of Theorem 12.4.101 is effective when S is recognizable (and given by a tree 
recognizer). 

Prove Lemma 12.4.171 


Prove Corollary 12.4.201 directly without using Theorems 12.4.161 and 12.4.181 


Let (f. As (A) —>■ As (A) be a homomorphism of S-algebras. Prove that if A G 
Rec(S, A), then (a) Tip G Rec(S, A) and (b) Tp~^ G Rec(S, A). 


The set of atomic SA-trees is defined as 

A(S, A) = {fT(xq,.. .,Xi^) I m > 0, u G xq, ... ,Xi^ G A}. 
For the sake of definiteness, let A = {xi,..., x^} (n > 1). Prove that 
(... (A(S, _ yxr. ^ p^^x) 


(cf. Thatcher and Wright |241I |L 
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17. 

18. 

19. 

20 . 

21 . 

22 . 


23. 


24. 

25. 

26. 

27. 

28. 
29. 


Let S = S 2 = {ct} and X = {x}. Write a regular expression for the forest of all 
SX-trees which contain an even number of a’s. 


Let S and X be as in Exercise 3. Construct a SX-recognizer for the forest repre¬ 
sented by the regular expression a{x,y) a(uj, a(uj, z))*^. 

Prove Theorem 12.6.61 

If A is a SX-recognizer and T{A) = T, then d is a homomorphism from Fi’ to 
A. Prove Lemma 12.6.21 using this observation. 

Prove Lemma 12.6.141 


In Section [2.71 we noted that one may define recognizability for subsets of algebras. 
We call T (C A) a recognizable subset of the S-algebra A = {A, S), if there exists 
a congruence 9 of finite index which saturates T. Denote by Rec A the set of all 
recognizable subsets of A. Prove the following facts: 

(a) If 5,r G RecA, then S UT, S OT, S - T e RecA. 

(b) If (/?: A —7> is a homomorphism and T G RecR, then T(p~^ G RecA. 

(Note. T G RecA does not imply Tip RecR. A counterexample where A and 
B are monoids can be found in Eilenberg’s book (Vol. A) mentioned among the 
references of Chapter [TJ) 


Let 11 = 112 = {< 7 } and X = {x,y}, and let {U,V) be the least fixed-point of the 
system 


u = x + (T{a{u,v),y) 
V = a{y,u). 


Find a regular (S, X, A:)-polynomial 11 (A: > 2) such that U and V can be rep¬ 
resented as unions of some compo nents of [II]. (For a general treatment of such 
questions see Mezei and Wright 182|.) 


Show that every local SX-forest Loc(R, F) can be represented in terms of the ele¬ 
mentary forests and the elementary operations intersection, union, and restriction. 
Note the resulting connection between the Theorems 12.8.61 and 12.9.51 


Show that the decidability of the equivalence problem of tree recognizers follows 
from the results of Section 12.61 


Prove Lemma I 2 .IU. 51 


Prove that it is decidable whether a recognizable forest can be recognized by a 
DR-recognizer. 

Are all local forests recognizable by DR-recognizers? 

Present algorithms for carrying out Steps 2 and 3 of the minimization algorithm 
for DR-recognizers which was outlined in Section 12.Ill 
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2.13 NOTES AND REFERENCES 


The observation (made about 1960) that finite automata may be defined as una ry al - 
gebras is at tribu ted to J. R. Biichi and J. B. Wright (see Mezei and Wright 182], 


Thatcher 239ll ). The generalization to tree autom ata was s uggested independently 


by Doner [blil . 66] and by Thatcher and Wright |24nl . l241 |. Many of the basic re 


suits presented in this chapter were obtained in various forms by several authors, and 
often it would be hard to establish any priorities^ _ Most of the important early con¬ 


tributions can be found i n Me zei and Wright [l82l |. E ilen berg and Wright 169 1. 


Thatcher and Wright 2^|, Doner ^|, Thatcher |238l | . Pair and 


HERE 


196C, 


Brainerd [^, Arbib and Give’on^], and Magidor and Moran [l6t 

Already in many of these papers trees were defined as terms, and this formalism is 
now very common. However, most authors use no separate frontier alphabet. Also, 
often operators may have more than one rank. The original reason for our use of frontier 
alphabets was to keep the character of the algebras independent of the number of frontier 
symbols. Another popular formalism defines a tree as a pair (D, A) consisting of a “tree 
domain” D and a labelling mapping A. Each element d of D specihes a node of the 
tree and X{d) is the label of this node. This dehnition is quite convenient for discussing 
concepts and operations which involve specific occurrences of subtrees. Tree domains 
were introduced by S. Corn in 1965 (for a reference, see Brainerd |^1. 

Deterministic and nondeterministic frontier-to-root tree recog nizer s were dehned, and 
their equivalence was es tabl ished, by Thatcher and Wright 241], Doner 66], and 


Magidor and Moran 
bin 


166l |. Root-to -fron tier tree recognizers were introduced by Ra- 


2041 1 . and Magidor and Moran [l66l ]. Magidor and Moran showed the equivalence 


of NDF and NDR recognizers, and they also studied DR recognizers. 

Regular tree grammars and the results of Section 12.31 are due to Brainerd 


40( 1 . In 


Brainerd’s grammars the form of the productions is quite general, but he shows that 
they can be reduced to, what we call, regular tree grammars. 

The Boolean closure properties of Rec(S, A) were noted in many of the early papers 
mentione d abo ve. The Kleene theorem iTheore m 12.5.811 was proved by Thatcher and 
Wright 241 ] and by Magidor and Moran |l66l |. A simplified proof was given by 
Arbib and Give’on [5|. Alphabetic tree homomorp hism s (called there projections) and 
Corollary [2320] appear in Thatcher and Wright 241 j. General tree h omomorph isms 
arose as special cases of finite-state tree transductions (see Thatcher |238l . 239| | and 


Engelfriet 7^). Tree transductions and tree homomorphisms will be considered in 
ChapterjH Fore st products (or “substitutions”) were also introduced in this context. Ito 
and Ando Il27l ] present a complete axiom system for the equality of regular expressions 


(cf. also Esik [91[). 

Minimal tree recognizers and Nerode congruences are discussed in Brainerd 
Arbib and Give’on j^, and Magidor and Moran (l66l |. 

The theory of equational forests is from Mezei and Wright 182| |. We have simpli¬ 
fied the exposition by considering only regular fixed-point equations. Mezei and Wright 
considered also equational and recognizable subsets of general algebras (cf. Exercise 22). 
They proved that the equational subsets of an algebra (of hnite type) are the homo- 
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morphic images of the recognizable subsets of term algebras. Applied to term algebras 
this result gives our Theorem 12.7.91 Eilenberg and Wright present these re¬ 
sults in a category theo retic form. For va riou s classes of sub sets in general al gebr as we 
refer also to W agner t249l| . Lesganne |l5nl |. Marchand [17,^ . Shepard [2201], and 
Steinby 227l |. Dubinsky 67(| discu sses equational and recognizable subsets of nonde- 
terministic algebras. Maibaum 170], and Engelfriet and Sghmidt 85] extend the 
subject into another direction by considering many-sorted algebras. 

The material of Section [21^1 is from Costich 5^. Local forests, or similar concepts, and 
resu lts related to Theo remsl2.9.4l and l2.9.5l can be found in Doner 66 ], Thatcher 237, 
238l |. and Takahashi 23^. 


The characterization of the forests recognizable by DR recognizers is from ViR AGH 2481 ] , 
although th e ba sic idea is discernible already in Magidor and Moran 166l ] (cf. also 
Thatcher 239| | 1. The minimization theory of DR recognizers appears in Gecseg and 


Steinby [l04i | 


We should also mention an alternative approach, originating with Pair and Quere 1961 ] 
and popular among French writers, in which the basic objects are tuples of trees rather 
than trees. The usual tree operations are then augmented by operations which catenate 
tuples of trees or form a tree from an m-tuple by creating a new root labelled by an 
m-ary operator. As an abstract framework for their study Pair and Quere introduced 
“binoids”, the tuples of trees form such a binoid. Their results include the basic clo¬ 
sure properties and a Kleene Theorem. This formalism has been developed further by 
Arnold and Dauchet 2l| to a th eory of “magmoids” which also embodies many of the 
ideas of Eilenberg and Wright 69 j. Arnold 0,0 discusses many topics relevant 
to this chapter within the framework of magmoids. 

We shall now discuss briefly some topics and applications of the theory not covered 
by this book. The survey is by no means complete, and in many cases the choices 
were dictated by personal preference. Some more remarks will be made at the end of 
Chapters El and HI 

The category theoretic treatment of recognizable and equational subsets by Eilenberg 
and Wright was already mentioned. It is based o n La wvere’s “theories”. This 
approach was developed further by Give’on and Arbib llll j. and others. The theory 
of magmoids has also evolved from the same ideas. We have avoided the use of category 
theory altogether, but the bibliography contains a sample from the extensive and highly 
diversified literature on the subject. The items of inter est in clude Alagic 0 , [ 1 , Arbi b 



Bo brow and Arbib 
10, and Trnkova and Adamek [20. 


Goguen jia, Goguen et al \ll4 . Ill5l |. 


and Manes 
Horvath 

The structure theory of tree automata has receive d littl e attenti on a lthough some initial 
steps were taken already by Magidor and Moran 166]. Rigci 209| | considered cascade 
products of tree automa ta. I terative realizations and general products of tr ee automata 
are studied in Steinby |22f)| |. Two sections of Gecseg and Steinby |in5l | are devoted 
to the subject. It is evident that generalizations from the unary case will usually not be 
easy in this area. 

Transition monoids have proved very useful in finite automaton theory and some 
equivalents of them for tree automata have been suggested. The “m-ary monoids” of 
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Give’on 11^ and the “substitution algebras” of Yeh [253| | are in fact spe cial Menger 
algebras. The same idea reappears in the “clone algebras” of Turner |246l |. Sommer- 


HALDER 


223l | develops the concept further and associates with an algebra a 


sequence 


Ml, M2 ,... of monoids. Here M„ consists of all n-tuples of n-ary polynomial functions of 
the algebra. It would be easy to define syntactic monoids of forests along these lines, but 
no such theory seems to have evolved yet. Another variant of the transition semigroup 
concept has been studied by Helton [l2ril |. 

We shall mention some other algebraic topics of potential interest. A SY-forest T is 
said to be recognizable by a S-algebra A = {A, S) if one may choose a: X ^ A and A'(C 
A) in such a way that {A, S, A') recognizes T. Families of forests recognizabl e by a lgebras 
belonging to a given vari ety ( equational class) were considered by Steinby 224l | and by 
Gecseg and Horvath |i 03 |. For a further study in this direction it would probably 
be advantageous to follow the example of Eilenberg’s theory of M-varieties and varieties 
of recognizable languages and consider “u)-varieties” (usually called pseudovarieties) of 
algebras and the families of forests corresponding to them; an w-variety is a class of 
finite algebras closed under the cons truct ion of subalgebras, homomorphic images and 
finite direct products. In Steinby 226l | it was shown that Eilenberg’s basic variety 
theorem can be extended to w-varieties and varieties of recognizable subsets of free 
algebras (suitably defined). A specialization of this result to term algebras gives a 
correspondence between w-varieties and varieties of recognizable forests. A SY-forest T 
is said to be rationally represented by an HY-recognizer A if there exists an embedding 
ip: Fs(Y) —>■ Fb(Y) of a certain kind such that Tip = T{A). A variety Y of algebras is 
said to be rationally complete if every recognizable forest can be ratio nally represented by 
a recognizer based on a finite algebra belonging to Y. Gecseg |inil | studies the rational 
completeness of varieties and the equivalence of tree recogn izers with respect to rat ional 
representation. Eurther results can be found in Maroti 176|, and Marchand fl^ 
also contains some related ideas. 

We shall now list a few references to some more topics. Proba bilis t ic tr ee automata and 
related topi cs haye b een discussed by Magidor and Moran 166,_167|, Ellis 
Karpinski Ml . 142l |. Eorests of infinite trees appear in Rabin 2oJengelfriei 
Gasteran ^ and Gourcelle An alternative way to generate forests i s pr o vide d 
by th e tre e adjunct grammars s tudie d by JOSHI, Levy and Takahashi O, H, 


and 


Levy |l55l |. and Levy and Joshi 


have been c onsid ered; see C ulik 
Karpinski 


15711 . Also Lindenmayer systems (L-systems) for trees 


143], Steyart [ 23 ^, and Szilard 231 ] 


56], Culik an d M aibaum 5^, Engelfriet 7^. iTij. 


Although we present our subject as a part of pure automata and formal language 
theory, it should be clear that it has many connections to the more applied aspects of 
language specification, translation and semantics. As a conclusion we would like to point 
out some less obvious areas of application. 

When Doner 


65l . I6 (tI | and Thatcher and Wright [2401 . l24ll | introduced tree au¬ 


tomata their goal was to prove the decidability of the weak second or der theor y of 
multiple successors. Eurther applications to logic can be found in Rabin 


204 1205]. 


In syntactic pattern recognition patterns are decomposed into simple basic elements 
which are represented by letters of an alphabet. A pattern is then represented, for 
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example, as a word. However, essential information about the relations between the 
basic elements may be lost if the corresponding letters are simply concatenated to form 
a word. It is possible that these can be described adequately by representing the pattern 
as a tree, and then tree automata theory may be used. For example, the considered class 
of patterns may be generated by a tree grammar or recognized by a tree recognizer. One 
specific problem prompted by syntactic pattern recognition is the inference of forests 
from samples. T he in terested reader may consult the books by Fu [971] and Gonzalez 
and Thomason 117 |. Some papers from this area are Berger and Pair Bra yer 


and Fu 
and Fu 


Fu and Bh arga va l98ll . Gonzalez, E dwa rds and Thomason [llfil j. Lu 


Pair [l94l |. Tai j232l |. and Williams |251 | 
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3 CONTEXT-FREE LANGUAGES AND 
TREE RECOGNIZERS 


The words generated by a context-free grammar can be read from derivation trees. The 
connection between forests and languages implied by this fact is the subject matter 
of this chapter. In the first section we define the yield-function by means of which a 
word is extracted from a tree. In Section 13.21 the basic relations between recognizable 
forests and context-free grammars are established. The usual definition of derivation 
trees must be modified slightly as to make them “trees” in our sense of the term, but 
the difference is inessential. The forest of derivation trees of any CF grammar is shown 
to be recognizable. On the other hand, we shall see that the yield of any recognizable 
forest is a CF language. Hence tree recognizers may also be viewed as recognizers of CF 
languages. The section is concluded by showing that every CF language is the yield of 
a local forest recognizable by a deterministic R-recognizer. 

The inverse image of a CF language under the yield-function is not always a recog¬ 
nizable forest, but we show in the beginning of Section [3.31 that the inverse image of a 
regular language is a recognizable forest. Also, a slightly restricted converse of this fact 
is presented. Then we show that every CF language can be obtained from a recognizable 
forest with a fixed and very simple ranked alphabet. Section (3.31 is concluded by some 
examples which show how facts about context-free languages can be proved using the 
theory of recognizable forests. 

In Section [3.41 another, less well-known, way to obtain the context-free languages from 
recognizable forests is presented. 

3.1 THE YIELD FUNCTION 

We shall now formally define the function that extracts a word from the frontier of a 
tree. This will also give a function that associates a language with every forest. 

Definition 3.1.1 The yield yd{t) of a SA-tree t is defined inductively as follows: 

1° yd(x) = a: for all X G A. 

2° If t = a{ti, (m > 0, cj G S^), then yd(t) = yd(ti). ..yd{tm)- 

The yield of a SA-forest T is the A-language yd(r) = {yd(t) | t G T}. 

To obtain the yield of a tree cT(ti,..., tm) one concatenates the yields of the subtrees 
In particular, yd(cr) = e for all a G Sq. More generally, yd{t) = e iff 
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t € -Fs(0)- The mapping 

yd: Fs(X) ^ X* 

is not injective; in general, a word is the yield of several trees. 

We use the same symbol yd for its extension to forests. Of course, yd presupposes a 
S and an X although our notation does not show this. 

Example 3.1.2 Let io G Sq, cr G S 3 , and x,y G X. For s = a{x, a{y,u},y),Lo) and 
t = a(u;, X, a{y, y, io)) we have yd(s) = yd(t) = xyy. □ 

Whether or not a given word w G X* is the yield of some SX-tree depends on the 
length of w and the arities of the operators in S. 

Lemma 3.1.3 Let r(S) = {mi,... ,mfc}. For a word w G X* there exists a tree t G 
Fs(X) such that yd{t) = w iff the length of w can be expressed in the form 

|r/;| = hi{mi — 1) + • • • + hk{mk — 1) + 1 

for some (integers) hi,..., /ifc > 0 . □ 

The proof of the lemma is an exercise. It is easy to see that yd(Fs(X)) = X* iff 
So / 0 and S — (Si U Sq) 7 ^ 0. When this is the case, there exists for every X-language 
L a SX-forest T such that yd(T) = L. The greatest among these is the forest 

yd“^(L) = {tG Fs(X) | yd(t) G L}. 

In general, we know just that yd(yd“^(L)) C L. From Lemma 13.1.31 one easily gets 

Corollary 3.1.4 For a given L C X*, there exists a forest T C Fs(X) such that 
yd(r) = L iff 

{Iwl \wGL}C {hi(mi - 1 ) H- 1 - hk{mk - 1 ) + 1 | hi,..., hfc > 0 }, 

where {nil,... ,mk} = r{Ti). 

In the following lemma we list some obvious properties of yd and yd~^. 

Lemma 3.1.5 Let S and T he EX-forests, and K and L X-languages. Then 
(a) yd(SUT)=yd(5)Uyd(r), 

(h) yd(SnT)Cyd(5)nyd(r), 

(c) yd-\K U L) = yd-^i^) U yd-^(L), 

(d) yd“^(X n L) = yd“^(X) n yd“^(L), and 

(e) yd-\K -L)= yd-\K) - yd-\L). □ 
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3.2 Context-free languages and recognizable forests 


3.2 CONTEXT-FREE LANGUAGES AND RECOGNIZABLE 
FORESTS 

In the customary definition of derivation trees the inner nodes are labelled by nonterminal 
symbols and a nonterminal may appear at nodes with different numbers of outgoing 
edges. Since we allowed a symbol of a ranked alphabet to have just one rank, the 
definition of derivation trees should be modified accordingly. 

Let G = {N,X,P,ao) be a CF grammar as defined in Section [L6l We associate with 
G a ranked alphabet thus: for each m > 0, 

= {(a,m) \{3a^r]€ P)\r]\ = m}. 

Definition 3.2.1 Let G and be as above. For every d £ N L) X the set D(G, d) of 
derivation trees with d as the root is defined by the following conditions: 

1° D{G,x) = {x} for each x £ X. 

2° For a £ N, (a, 0) £ D{G, a) iff a —>■ e € P. 

3° Suppose a ^ di .. .dm £ P, with m > 0, a £ N and di,... ,dm £ X U X. If 
h £ D{G,di),... ,tm & D{G,dm), then (a,m)(ti, ... ,tm) € D{G,a). 

4° Nothing is in any D{G,d) unless this follows from a finite number of applications 
of the rules 1°, 2° and 3°. 

The derivation forest of G is the S'^X-forest D{G) = D{G,ao). 

Exactly as in the case of conventional derivation trees, every t in D{G, d) (d £ N UX) 
corresponds to a unique leftmost derivation in G of the word yd(t) from d. Also, every 
derivation 

d =^G ui =^G ■ ■ ■ =>G Uk-l =^G w, 

with d £ N [J X and w £ X*, can be described by a tree t £ D{G,d) such that 
yd(t) = w. This is easily shown by induction on the length of the derivation. Hence, 
L{G)=yd{D{G)). 

Theorem 3.2.2 The derivation forests of CF grammars are local and, therefore, recog¬ 
nizable. 

Proof. Let G = {N,X,P,ao) be a CF grammar. It is obvious that D{G) is the local 
S*^X-forest L{R,F) (in the notation of Section [2.91) . where 

R = {(oo, m) I m > 0, (oq, m) G 

and the set F of the allowed forks is defined as follows. If m > 0 and a ^ di .. .dm £ P, 
then we include in F every fork (a,m)(ci,..., Cm) such that for alH = I,...,m, 

^ r d, iidi£x, 

\ [di, k) with A: > 0 and {di, k) £ if di £ N. 
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Nothing is in F unless this follows from the construction described above. □ 

It is also easy to see that D{G) is generated by the regular S'^X-grammar Gd = 
(N, X, Pd, oo), where 

Pd = {a —?> (a, m){di,..., dm) | m > 0, a ^ di... dm ^ P, di,. ■ ■ ,dm & N U X}. 

Example 3.2.3 Consider the CF grammar 

G = ({ao, b}, {x, y}, {uq ^ xaob, oq ^ e, 6 xyb, b y}, ao). 

In this case U U where = {(ao,0)}, = {(6,1)} and = 

{(ao, 3), (6, 3)}. The productions of the grammar Gd = {N,E^, X, PD,ao) generating 
D{G) are qq —>■ (ao, 3)(x, oq, 6), oq —>■ (ao,0), 6 —>■ (6,3)(x,y,6) and 6 —(6, l)(y). The 
allowed roots of the local forest D{G) are (ao,0) and (ao,3), and the possible forks are 
(ao,3)(x, (ao,0),(6,1)), (ao, 3)(x, (ao, 0), (6, 3)), (ao, 3)(x, (ao, 3), (6,1)), (ao, 3)(x, (ao, 3), 
(6, 3)), (6, 3)(x, y, (6,1)), (6, 3)(x, y, (6,3)) and (6, l){y). □ 

Theorem 13.2.21 yields immediately 

Corollary 3.2.4 Every CF language is the yield of a recognizable forest. □ 

The converse is also true: 

Theorem 3.2.5 The yield of any recognizable forest is a context-free language. 

Proof. Let G = {N,T,,X,P,ao) be a regular SX-grammar generating the given recog¬ 
nizable SX-forest T. To simplify matters we assume that G is in normal form. Now we 
construct the CF grammar Gi = (X,X,Pi,ao) with 

Pi = {a —)• yd'(p) I a —> p € P}. 

Here yd^ is the yield-function corresponding to the extended frontier alphabet X U X. 
Inductions on the lengths of the derivations show that 

(1) a =>Q t implies a yd(t), for all a € X, t € Ps(X), and that 

(2) for all w € X* and a & N, a ^ ™ there exists a tree t € Ps(X) 

such that a t and yd(t) = w. 

These two facts imply that yd(r) = L{Gi) is CF. □ 

In view of Theorem l3.2.5l anv tree recognizer may be seen as a device which recognizes 
a CF language by checking the possible syntaxes of given words; a word is accepted iff 
it is the yield of at least one tree accepted by the tree recognizer. 

Definition 3.2.6 The language recognized by a SX-recognizer A is the X-language 
P(A)=yd(r(A)). 
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3.2 Context-free languages and recognizable forests 


The previous results can now be expressed as follows. 

Theorem 3.2.7 A language is recognized by a tree recognizer iff it is context-free. □ 

The equivalence expressed in Theoreni l3.2.7l is effective both ways; for any CF language 
given by a CF grammar we can construct a tree recognizer, and for any tree recognizer 
A we can construct a CF grammar generating L{A). 

By Theorem 13.2.21 every CF language is the yield of a local forest. We shall now show 
that even a smaller class of forests will suffice. To this end we replace derivation trees 
by trees in which the inner nodes are labelled by complete productions. 

With every CF grammar G = {N, X, P,ao) we associate another ranked alphabet 
defined as follows. For each m > 0, let 

= {(® —>■ ??) I a —^ is in P and \7]\ = m}, 

i.e., the m-ary symbols correspond to the productions with right-hand sides of length 

m. 

Definition 3.2.8 Let G and be as above. For every d € A U A the set P{G,d) of 
production trees with d at the root is defined by the following conditions: 

1° P{G, x) = {x} for each x € X. 

2° For a G N, {a ^ e) G P{G, a) iff a —> e € P. 

3° Suppose a ^ di... dm & P (m > 0, a N and di,... ,dm S A U X). If pi E 
P(G, di),... ,pm e P{G,dm), then (o di . ..dm){pi, ■ ■ ■ ,Pm) G P{G,a). 

4° Nothing is in any P{G,d) unless this follows from a finite number of applications 
of 1°, 2° and 3°. 

The production forest of G is the S^A-forest P{G) = P(G, oq). 

In our previous discussion of DR-recognizers we excluded nullary symbols, but since 
the ranked alphabets may contain such symbols, we now extend the definition of a 
DR SA-recognizer A = (A, oq, A') by setting cr-^ E A and ad = {a-^} for any a E Sq. 

Theorem 3.2.9 The production forest P{G) of any CF grammar G is local and it is 
also recognizable by a deterministic R-recognizer. 

Proof. Let G = (A, A, P, oq) be a CF grammar. The presentation of P{G) as a local for¬ 
est is similar to that of D{G). We construct a DR S-^A-recognizer A = (A, A, A', a) 
as follows. Put A = A U A U {d} (d ^ A U A), A' = {ao}, and for each x E A, 
xa = {x}. Next, the underlying root-to-frontier algebra A = (A, is defined. If 
a = {a ^ e) G then = o. Let cr = (a —>■ ci... Cm) G with m > 0. Then we 
put a-^{a) = (ci,..., Cm), and a^{b) = (d,..., d) for all b a. It is easy to show by tree 
induction that for all t E Fy.p{X) and a E A U A, 

a E td iff t&F{G,a). 
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This implies that A recognizes P{G). □ 

The language recognized by an R-recognizer is defined in the natural way. As it is 
obvious that yd(P(G)) = L{G) for every CF grammar G, we may state 

Corollary 3.2.10 Every CF language is recognized by a deterministic R-recognizer. □ 

3.3 FURTHER RESULTS AND APPLICATIONS 

Every CF language L is the yield of many different forests. Such a forest is not neces¬ 
sarily recognizable. In particular, the greatest of them (for a given S) yd~^(L) may be 
nonrecognizable. 

Example 3.3.1 Let S = S 2 = {o'} and X = {x,y}. Consider the (minimal linear) CF 
language L = | > 1}. If yd~^(L) were recognized by a SA-recognizer A, then 

A would accept all trees a{si, ti) {i > 1), where (i) si = x, ti = y and (ii) Sj+i = (j{sk, x) 
and tk+i = a{y,tk) for all A: > 1. As A is finite, it would then also accept some tree 
a{si,tj) with i / j. But this is a contradiction, because yd(o'(si, tj)) = x'^y^ ^ L. □ 

In contrast to Example 13.3.11 we have 

Theorem 3.3.2 If L is a regular X-language, theny(I~^{L) € Rec(S, A) for any ranked 
alphabet S. 

Proof. Let M be a finite monoid, tp : X* ^ M a homomorphism and H a subset of M 
such that L = Let A = (M, S) be the S-algebra defined so that 

o''^(ai,..., Om) = ai ■ 02 ■■■■■ am (product in M) 

for all m > 0, O' E T,m and oi,..., E M. In particular, = 1 when a E Sq. If we 
put 

a = p>\X: X ^ M, 

then 

ta = yd(t)(/9 for all t E Fs(A). 

This implies that yd“^(L) = T(A) for the SA-recognizer A = {A,a,H). Indeed, for all 
t€Fs(A), 

t E T(A) iff ta = yd{t)(p E H 
iff yd(t) E L 
iff t E yd“^(L). 

□ 

The full converse of Theorem 13.3.21 is not valid, but the following result will be proven 
in Exercises 6 and 7. 
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3.3 Further results and applications 


Theorem 3.3.3 Let L (C X*) be a language and S a ranked alphabet such that 
yd(yd“^(L)) = L. Then yd“^(L) G Rec(S,X) implies L G RecX. □ 

The ranked alphabets and depend on the given CF grammar. We shall now 
show that every CF language is the yield of a recognizable forest over a fixed ranked 
alphabet. In fact, a very simple alphabet will suffice. 

Theorem 3.3.4 Let T, be a ranked alphabet which contains a binary operator and a 
nullary operator. Then every CF language is recognized by a T,-recognizer. For e-free 
CF languages the binary symbol alone is sufficient. 

Proof. Let us consider the e-free case first. Every CF language L C is generated 
by a CF grammar G = {N,X,P,ao) in Chomsky normal form, where each production 
is of the form a ^ be or a ^ x {a, b,c d N, x G X). By Lemma 12.4.11 we may assume 
that S = S 2 = {ct}. Let Gi = {N,T,, X, Pi, uq) be the regular SX-grammar, where 

Pi = {a ^ a{b, c) I a —>■ 6c G P} U{a^x|a—7>xG P}. 

Adjoin N to the frontier alphabet and let 

yd': Ps(XUX) ^ (X U iV)* 

be the corresponding yield-function. By induction on the length of the derivation one 
can verify that for every derivation 


a =^G ui =^G ■ ■ ■ =^G Uk {a e N,k> 1) 


there is a derivation 

a Pi =^Gi ■ ■ ■ =>Gi Pk (Pi, ■ ■ ■ € Pe(X U N)) (*) 

such that yd'{pi) = Ui for i = 1,... ,k. This implies L{G) C yd(r(Gi)) as yd'|Ps(X) = 
yd. The converse inclusion follows from the fact that for every derivation (*) we have a 
derivation 

a ^G yd'(pi) ^G yd'ipk)- 

If P C X* and e (z L, then we find, as above, a recognizable SX-forest T such that 
yd(T) = L — {e}. Now add a nullary operator cj to S and let T' = T U ui. Then T' is 
recognizable and yd(r') = L. □ 

The connections established above suggest the possibility of developing, or just inter¬ 
preting, the theory of context-free languages in terms of tree automata and recognizable 
forests. We shall illustrate this by a few examples. The results themselves are well 
known. 

Theorem 3.3.5 The intersection of a context-free language with a regular language is 
context-free. 
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Proof. Consider a CF language L C X* and a regular language U over the same 
alphabet. Choose any ranked alphabet S and a recognizable SX-forest R such that 
yd(ii) = L. Then 

LnC/ = yd(i?nyd"^(17)). 

Since i?nyd“^(17) € Rec(S,X) by Theorem 13.3.21 and Theorem 12.4.21 this means that 
L nU is context-free. □ 

The next example shows how the regular forest operations relate to language opera¬ 
tions. 

Definition 3.3.6 Let U and V be X-languages and x X. The x-substitution of U 
into V is the language U -xV of all words 


W 0 U 1 W 1 U 2 ■ ■ ■ Wk-lUkWk, 

where k > 0, ui,... ,Uk € U, wqxwix ... xwk-ixwk G V and x does not appear in the 
word wqWi .. -Wk- 

The X-substitution closure of U is the language 

U*^ = I i > 0), 

where = {x} and -x U U for i > 0. 

Consider two SX-forests S and T and a symbol x € X. Every tree p G S -x T is 
obtained from some tree t € T by replacing each occurrence of x by some tree from S. 
Suppose X appears k times {k > 0) in t and that we get p by replacing these occurrences, 
from left to right, by the trees si,..., Sk € S. If 

yd(t) = wqxwix ... xwk, 


then 

yd(p) = t(;oyd(si)u;iyd(s 2 ) ■ ■ ■ yd{sk)wk G yd(5) -a, yd(r). 
Conversely, if u; G yd(S') -x yd{T), then we may write w in the form 


W = W0U1W1U2 . . . Wk-lUkWk 

so that A: > 0, woxwix ... xwk G yd(T) and rti,..., G yd(S'). Then there are trees 
t £ T and si,...,Sfc G S such that yd(t) = wqxwix ... xwk and yd(si) = ui, ..., 
yd(sfc) = Uk- If we replace the occurrences of x in t by the trees si,..., s^, then we get 
a tree p £ S -xT such that yd(p) = w. An easy induction on i shows now that 

yd(r’*) = yd(r)*’^ for all i > 0. 

Using these observations we get 

Lemma 3.3.7 For any two EX-forests S and T, and any letter x £ X, 
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3.4 Another way to recognize CF languages 


(a) yd(S' -a; T) = yd(S’) -a; yd(T) 
and 

{h) Yd{T*^) = yd{T)*\ □ 

Now we can derive the following well-known description of the family of context-free 
languages. 

Theorem 3.3.8 The context-free languages form the smallest family of languages which 
contains the finite languages and is closed under (finite) union, x-substitutions and x- 
substitution closures. 

Proof. Clearly, all finite languages are context-free. Let U,V C X* be CF and x & X. 
There exist recognizable forests S,T C Fj^{X) such that yd(5) = U, yd(T) = V. Now 
UUV = yd(5UT), U-xV = yd{S-xT) and V*^ = yd(T*^) are all seen to be context-free. 
On the other hand, the Kleene theorem (Theorem 12.5.81) together with Corollary 13.2.41 
and Lemma 13.3.71 shows that every CF language can be obtained from finite languages 
by forming unions, x-substitutions and x-substitution closures. □ 

Note that when a CF X-language is expressed in terms of finite languages, unions, 
substitutions and substitution closures, symbols not in X may be used as auxiliary 
symbols in substitutions. 

As an example we consider the language L = | re > 0}. Let w E Sq and cr E S 3 . 

Then L is the yield of, for example, the recognizable SX-forest 

T = {w, C 7 (x, w, y), cr(x, ct(x, w, y), y),...} 

which has the regular expression uj a{x, z,y)*^. From this we get for L the represen¬ 
tation 

L = {e} {xzyr- 

Here z is an auxiliary letter which does not appear in the language represented. 


3.4 ANOTHER WAY TO RECOGNIZE CF LANGUAGES 


If an ordinary finite automaton is viewed as a unary algebra, then its input symbols form 
a ranked alphabet. There is a way to interpret SX-trees as words over S in the general 
case, too. When this is done, recognizable forests become CF languages. Moreover, 
every CF language can be obtained this way as a recognizable forest once its alphabet 
is suitably ranked. 

We consider the unary case as an introduction. The word 

try = cJi... CTfc E S* 

can be obtained from the corresponding S{x}-tree 

t = (Tfc(. . .cri(x)...) 


recursively as follows: 
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1° XT] = e for all x € X. 

2° trj = srja if f = (t(s) {a € S). 

Another way to get trj would be to erase the parentheses and x and then reverse the 
resulting word. Both of these constructions can serve as a basis for the generalization 
to the case of an arbitrary ranked alphabet. The reversing of the order of the word 
is an inessential step due to our way of writing trees, and it will be omitted in the 
generalization. 

Let S be an arbitrary ranked alphabet and X any frontier alphabet. We shall treat S 
as an ordinary alphabet, too. We assume that S and X are disjoint and that they do 
not contain (, ) or the comma. Let 

y = suxu{(,),,} 


and define 

r]:Y* 

as the monoid homomorphism such that 


yri 


y for y e S, 
e for y £ Y — Ti. 


Applied to a SX-tree t rj erases all frontier letters x £ X, the parentheses and the 
commas leaving the symbols u G S intact. It is easy to see that this can be carried out 
as follows, too. 


Lemma 3.4.1 The words trj (t £ Fy,{X)) can he found recursively as follows: 

1° XT] = e for all X £ X. 

2° If t = cr{ti ,..., tm) (m>f), a £ Em), then trj = atirj... t^rj- □ 

We have already noted that every regular SX-grammar may also be viewed as a 
CF grammar generating a X-language. Moreover, it is well-known that the family of 
context-free languages is closed under homomorphisms. Hence we have 


Lemma 3.4.2 // T G Rec(S,X), then Trj £ CF(S). □ 

Next we prove the following converse of Lemma 13.4.21 


Lemma 3.4.3 Let S and X be alphabets. If E is ranked so that E 2 = E, then there 
exists for each CF language L YE* a recognizable EX-forest T such that Trj = L. 


126 




3.5 Exercises 


Proof. First, let L be e-free. Then L is generated by a CF grammar G = (A^, S,P, oq) 
in Greibach 2-form, where each production is of the form (i) a —)■ abc, (ii) a —)• cr6 or 
(iii) a ^ a (a,b,c G N, a £ S). We convert G into a regular SX-grammar Gi = 
(N,'E, X, Pi,ao), where the set Pi of productions is defined as follows. Fix any x G X 
and put then 

Pi = {a ^ a{b, c) I a —)• abc G P} U {a —>■ a{h, x) \ a ^ ab G P} U 

{a — a{x, x) \ a ^ a G P}. 

In order to show that T(Gi) is the required recognizable forest we extend r/ to a homo¬ 
morphism 

? 7 i|(y uiv)* ^ (suiv)* 

so that r]i\Y = rj and r/ilA" = Ijv- It is easy to see that to every derivation 

a =^G ui =^G ■ ■ ■ =^G Uk {a G N,k> 1 ) 
there corresponds a derivation 


a ^Gi Vi ^Gi ■ ■ ■ =^Gi Vk (*) 

such that ViTji = ut {i = 1,..., fc). Conversely, every derivation (*) is matched by the 
derivation 

a =^G virji =^G ■ ■ ■ =^G Vkrji- 

Since r]i\Y* = r], this implies T{Gi)rj = L{G) = L. li e G L, we apply this construction 
to P — e and add then the tree x to T{Gi). □ 

In the representation of Lemma 13.4.31 the frontier alphabet X can be fixed in advance 
independently of S and the language L. A one-element alphabet X = {x} always suffices. 

We say that a SX-recognizer A rj-accepts a word tc G S*, if it accepts at least one 
SX-tree t such that trj = w. The S-language r/(A) r]-recognized by A is the set of all 
words ry-accepted by A. In this terminology the previous results may be summed up as 
follows. 

Theorem 3.4.4 A language is rj-reeognized by some tree reeognizer iff it is a context-free 
language. □ 

3.5 EXERCISES 

1. Is is possible that yd~^{w) is inhnite for some word w? 

2. Prove Lemma [3.1.31 

3. Find an example of a nonrecognizable forest T such that yd(r) is a recognizable 
language. 

4. Show that for every CF grammar G, D{G) is the image of P{G) under an alphabetic 
tree homomorphism. 
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5. Recall that a groupoid is an algebra with one binary operation (and no other 
operations). For S = S 2 = {cr}, Fy:{X) is the free groupoid generated by X. 
Verify that yd: Fs(X) —)• X~^ is a groupoid epimorphism. Then prove that a 
language L C X~^ is context-free iff it is the homomorphic image of a recognizable 
subset of the free groupoid generated by X (cf. Exercise 2.22, and Mezei and 
Wright |l82| |b 


6. The set Comb(S,V) of “comb-like” SX-trees is defined as the smallest set S 
satisfying the conditions 1° and 2°: 

1 ° XLSEqCS. 

2° If m > 0, (T G Em, xi,..., Xm-i G X and t G S, then a{xi ,..., Xm-i,t) G S. 

(a) Prove that Comb(S,X) G Rec(S,X). 

(b) Let T be a recognizable forest such that T C Comb(S,X). 

Show that T is generated by a regular SX-grammar {N,E,X,P,ao) in 
which each production has the form a —>■ cj(xi, ... ,Xm-i,b), a —>■ a; or 
a ^ X {a,b G N, m > 0, a G Em, xi,..., Xm-i G X, oj G Eq, x G X). 

(c) Infer from (b) that yd(T) G RecX for every recognizable T C Comb(S, X). 

(d) Prove that for every SX-tree t there exists a comb-like SX-tree s such 
that yd(s) = yd(t). Deduce from this fact that if yd(yd“^(L)) = L for 
some L C X*, then 

yd(yd“^(L) n Comb(S, X)) = L. 


7. Prove Theorem 13.3.31 using the results of the previous exercise. 

8 . Give another proof for Theorem 13.3.41 using the fact that every CF language can 
be generated by an invertible CF grammar in Chomsky normal form. 


In Exercises 9-12 the theory of recognizable forests should be applied. 

9. Prove that the languge U — V is CF if U is CF and V is a regular language. 

10. Let (p‘. X* — 7 > Y* be a homomorphism of monoids. Prove that Lip~^ G CF(X) for 
every L G CF(y). 

11. Let h{t) denote the tree which is obtained from a given tree by rewriting every op¬ 
erator a as its rank r{a). Obviously yd{h{t)) = yd(t). Show that h can be defined, 
for any given E and X, as an alphabetic tree homomorphism. Two CF grammars 
Gi and G 2 are said to be structurally equivalent if h{D{Gi)) = h{D{G 2 ))- Prove 
that there is an algorithm to determine whether or not two CF grammars are 
structurally equivalent. 

12. Prove Bar-Hillel’s pumping lemma iLemma 11.6.131) . 

13. Let G be a regular SX-grammar. Construct a CF grammar G' such that L{G') = 
T{G)r]. Note that Lemma 13.4.21 follows as a result. 
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3.6 Notes and references 


3.6 NOTES AND REFERENCES 


The basic connection between recognizable forests and context-free languages has been 
established in various ways. Mezei and Wright 182l | proved that the equational sub¬ 
sets of an algebra of hnite type (in the monoid X* these are the CF languages) are 
the homomorphic images of the recognizable subsets of term algebras, i.e., recognizable 
forests. Applied to groupoids this theorem gives the result of Exercise 5 (credited to 
D. Muller). I t als o implies Theorem 13.3.41 which was explicitly formulated by Magidor 
and Moran 
and Doner 


166ll . The proof using derivation forests goes back to Thatgher |237 , 238| | 
6611. Various forms of p roduction trees have been used in this context by 
Eng elfriet |74l ]. and Steinby 224]. Theorem 13.3.21 appears, for example, in Rounds 
215l | . It is a special instance of the fact that the inverse homomorphic images of recog¬ 


nizable subsets of algebras are recognizable (cf. Exercise 2.22). Theorem 13.3.31 ap pears 
to be well-known. The proof outlined in Exercises 6 and 7 is from Steyart [229(]. The 
idea to use tree automata in the theory of CE languages was prop osed by Rounds |214l |. 
More examples of such applications can be found in Thatgher |239| | and Engelfriet 


74l |. The results of Section 13.41 are due to Eerenci 93|. The interested reader may 


consult Eerengi 9^ for further work in this direction. 

As a conclusion we mention a few other topics. Using a r anked nonterm inal alphabet it 
is possible to dehne context-free tree grammars. Rounds lili. shows that the 


yiel d-languages of CE forests are exactly the indexed languages. Arnold and Daughet 


[H, [13, [3]) and Engelfriet and Sghmidt 85] are some further references. 

Possibilities to extend some of the results of this chapter to type 0 or context-sensitive 
languages by generalizing the t ree-c oncept have be en in vestigated by Benson 3^, But- 
telman 


3, Hart [13,1 l3|, and Revesz |2n8| |. Hierarchies of term lang uage s 


obtained by iteration of the yield-forming proce ss have b een studied by Maibaum 1701 ]. 
Engelfriet and Schmidt 85], and Turner 245 . l246| ] . Eamilies of languages dehned 
by tree recognizers b ased on algebras belonging to a give n variety of algebras were con¬ 


sidered by Steinby [2241 ]. Gecseg and Horvath [l03l ] showed that a proper variety 


may be complete in the sense that every CE language is recognizable by a hnite algebra 
of the variety (cf. the Notes and references section of Chapter [2]). 
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4 TREE TRANSDUCERS AND TREE 
TRANSFORMATIONS 


In this chapter we shall deal with systems transforming trees into trees similarly as 
generalized sequential machines transform strings into strings. There are two main 
categories of such systems: frontier-to-root tree transducers which process a tree from 
the leaves down towards the root, and root-to-frontier tree transducers which work in 
the opposite direction. Special classes of tree transducers will play a basic part in 
decomposing tree transformations into simpler ones. 

4.1 BASIC CONCEPTS 

Throughout this chapter, S, and A will stand for ranked alphabets. It will be assumed 
that whenever an operator belongs to more than one ranked alphabet, then it has the 
same rank in all of them. Moreover, X, Y and Z will always stand for (finite, nonvoid) 
frontier alphabets. 

Let us recall that Fj]{S) as defined in Section [2T] denotes the set of S-trees over the 
frontier alphabet S. Here we shall allow S' to be a possibly infinite set of trees and then 
use the notation Ts[S] for Ts(S). One can easily see that in such a case there always 
exist a ranked alphabet H and a frontier alphabet Y such that Ts[S] C Fj^{Y). 

Binary relations r C Fj^{X) x Ffi{Y) will be called tree transformations. An inclusion 
(P) O') G T is interpreted to mean that r may transform p into q. Because tree transfor¬ 
mations are binary relations, we can speak about compositions, inverses, domains and 
ranges of tree transformations as defined in Section 11.11 

With each tree transformation r C Fj^{X) x Fq(Y) we associate the translation 
{(yd(p),yd(g)) I {p,q) G r} from A* into Y*. 

The important tree transformations are those which can be given in an effective way. 
Next we define two general systems (tree transducers) inducing such transformations. 
We shall need a countably infinite set 


^ • ■ •} 


of auxiliary variables. The subset of H consisting of its first n > 0 elements will be 
denoted by i.e., En = {^i,..., f,n}- The role of an auxiliary variable is to indicate an 
occurrence of a subtree in a tree. 

If all variables occurring in a tree q are among ^i, ..., then the notation ^(.^i,..., ^n) 
may be also used for q. Moreover, if gi, ..., qn are arbitrary trees, then we generally 
write q{qi, ...,qn) for q{ii ^ gi,... ^ qn). 
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4 TREE TRANSDUCERS AND TREE TRANSEORMATIONS 


Definition 4.1.1 A frontier-to-root tree transducer (E-transducer) is a system 21 = 
{T.,X,A,n,Y,P,A') where 

(1) S and El are ranked alphabets, 

(2) X and Y are frontier alphabets, 

(3) A is a ranked alphabet consisting of unary operators, the state set of 21, 

(It will be assumed that A is disjoint with all other sets in the definition of 21, 
except A'.) 

(4) A' C A is the set of final states, and 

(5) P is a finite set of productions (or rewriting rules) of the following two types: 

(i) X a{q) {x e X, a e A, q G Fq{Y)), 

(ii) <T(ai(6), • • • ,am(lm)) ^ a(4(?i, ■ ■ ■,Cm)) (o' € m > 0, oi,..., a^,a G 

4(Cl) ■ ■ ■ ) Cm) G Ffi(Y U ^m))- 

(In the sequel we shall write simply ... ,am) for (T(ai(Ci), • • •,am(Cm))-) 

We shall use also the notation {p, q) for a production p ^ q. Moreover, if a G A is 
a state and t is a tree, then we generally write at for a{t). Similarly, if T is a forest, 
then AT will denote the forest {at | a G A, t € T}. Furthermore, for any a G A, we put 
21(a) = {Y,X,A,n,Y,P,a). 

Let us note that in the above definition it would be more exact to speak about produc¬ 
tion schemes instead of productions. Indeed, soon we shall see that they define patterns 
for rewriting trees. 

Next we define the transformations induced by F-transducers. Consider the F-transducer 
21 of Definition 14.1.11 and, for every p G Fy,[X U AH], let pr^ be the subset of AFq{Y UH) 
given as follows: 

(1) if p = aC (a G A, ^ G H), then G pr^, 

(2) if p € A U So, then aq G pr^ for all {p, aq) G P, 

(3) if p = a{pi,...,pm) {cr G S^, m > 0), then aq{qi,... ,qm) G pr^ for all 
{a{ai,... ,am), aq) G P and Oiqt G piT^ (a, a* G A, i = 1,... ,m), and 

(4) nothing is in any pr^ unless this follows from (l)-(3). 

Definition 4.1.2 Take an F-transducer 21 = (Y, X, A,D,Y, P, A'). Then the relation 
Ta = {{p, q) \ p& Te(A), g G Fn{Y), aq G pr^ for some a G A'} 
is called the transformation induced by 21. 
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4.1 Basic concepts 


For Definition 14.1.21 it would be enough to apply to trees from Fj^{X). The above 
more general case will be needed later. 

Sometimes in our proofs we should know how an input tree is transformed step by step 
into an output tree. Again, let 21 be the F-transducer of Definition 14.1.11 and consider 
two trees p, q & F^[X Li AFq{Y U H)]. It is said that p directly derives g in 21 if g can be 
obtained from p by 

(i) replacing an occurrence of an x E A in p by the right side aq of a production 
X ^ aq from P, or by 

(ii) replacing an occurrence of a subtree a{aiqi, ..., amqm) (o' E Smi oii • • •, Om S A, 
qi,...,qm G Fniy U H) in p by ag(gi,..., g™,), where fj(ai,..., a^) ag is a 
production from P. 

Each application of rule (i) or rule (ii) is called a direct derivation in 21. If g is obtained 
from p by a direct derivation in 21 (i.e., p directly derives g in 21), then we write p g. 

Therefore, is a binary relation in Fy.[X U AFq(Y U H)]. If there is no danger of 

confusion, we generally omit 21 in 

By finitely many consecutive applications of direct derivations we get derivations. 
Accordingly, for any two trees p, q € F^[X L) AFq{Y U H)] we say that 

p = Po ^ Pi ^ ^ Pi ^ ^ Pj ^ ^ Pk = q (1) 

{k > 0,pe E Fs[A U AFq{Y U S)], £ = 1,..., /c, 0 < i < j < k) 

is a derivation of g from p in 21, A; is the length of this derivation and pi ^ ^ pj 

is a subderivation of (1). In this case we write p g, or p g if 21 is understood, 
and say that p derives q in 21. Therefore, =>* is the reflexive-transitive closure of =>. 
Obviously, when p g, there could be several (but finitely many) derivations of g 
from p. However, when we write p =^* g, we usually have in mind, at least implicitly, a 
certain well-defined derivation of g from p. Consequently, we may say that p =^* g is a 
derivation. 

Using the notation the transformation rji induced by an F-transducer 21 = 
(S, X, A, Q, Y, P, A') can also be given thus: 

Ta = {{p-,q) I P G -Us(A),g E Fn{Y), p aq for some a E A'}. 

As 21 may have different productions with the same left side, there could be more 
than one g E F^{Y) such that (p,g) E t% for a given p E Fs(A), i.e., 21 is in general 
nondeterministic. However, at each step of a transformation we have only finitely many 
choices. Therefore, pra is finite for every p E Fji{X). 

A tree transformation is an F-transformation if it can be induced by an F-transducer. 
The class of all F-transformations will be denoted by F. 

Take an arbitrary set A. The zth component of a vector a E A” will be denote by ap 
i.e., a = (oi,..., On)- If oi = ... = = a then for a we write a”. If a E A^ and b E 

are arbitrary two vectors, then (a, b) will stand for (ai,..., an, bi,..., bm)- Assume that 
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k = min(m, n). Then ab stands for {aibi ,..., a^bk) or ((ai, 6 i),..., (a^, 6^)), depending 
on the context. 

Consider a p G Fy;{X U En), and let p = {pi,... ,pn) be a vector of trees. Then we 
shall write p(p) for p{pi,... ,Pn)- Moreover, if p G Fs(X U and q = (gi,..., is 

a vector of trees, then p(q) will stand for (pi(q),... ,Pm(q))- 
Consider the homomorphism p: {X U E)* —)■ E* given by xp = e {x G X) and ^ 
{(gE). Set 

Fj:{X U En) = {p G Fy,{X U En) I yd{p)ip is a permutation of ^i,... 

and 

Fy,{X U En) = {pG Fj^{x U En) I yd{p)ip = 6 • • • U}- 
Moreover, if m > 0 then let 

F^{XUEn) = {pG Fj^{XUEn)'^ \yd{pi)ip...yd{pm)(p is a 
permutation of ^i,..., 

Now let 21 = (S, X, A, D, Y, P, A') be an F-transducer, and consider a derivation 

a:p^*q {p,q G Fj:[X U AFn{Y)]). 


Let 


ripi,P2) r{pi^,p2) ^ ... ^ ?’(Pifc,P2) ^ r{pi^,p2) 

{rGF^[XuAFn{Y)UE2]) 


( 2 ) 


be a subderivation of a, where the first k direct derivation steps apply to the subtree pi, 
and then the (k + l)th step concerns the subtree p 2 - Replacing the subderivation (2) in 
a by 

ripi,P2) => r{pi,p2) r{pi^,P2) ^ ... ^ r{pi^,p2) (3) 

we obviously get a new derivation 


(3:p^* q. 

The replacement of (2) in a by (3) is called an inversion of direct derivations. Finitely 
many inversions of direct derivations is a reordering of direct derivations. 

In the sequel we do not distinguish between derivations obtained from each other by 
reorderings of direct derivations. 

Again, consider the above F-transducer 21 and a tree p G Fy,{X). Then by 
P = P{P1, ■ ■ ■ ,Pm) =^* p{aiqi, ■ ■ ■,amqm) =^* aq{qi, ...,qm) 

ip £ Dj ](A U ^fn)i Pi ^ , i — 1,..., m., pi^cii^i,, cim^m) ^ (^q) 

we mean the derivation 

p{pi,---,Pm) ^ p(pii, • • • ,Pm) ^ ^P{Plki,---,Pm) ^ 
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p{aiqi,..., amqm) =^* aq{qi, ...,qm) 

if Pi =^* aiqi is the derivation pi ^ pi^ ^ ^ pi^^ = aiqi (oj e A, qi e Fn{Y), 

i = 1 ,..., m), and p{aiqi,..., amqm) =^* aq{qi,..., qm) is obtained by replacing in 
p(ai 6 ,.. .,am^m) =^* aq by qi = . ,m). 

If we say that we write the derivation 

a:p^*aq {a e A,p £ FY,{X),q e Fq{Y)) 

in the (more detailed) form 

/3: p = p{pi,...,Pm) p{aiqi,... ,amqm) aq{qi,... ,qm) 
ip € Fj] U ^m)j Pi ^ a^qi, i — ^ aq) 

this also generally means that /3 is a reordering of a. Of course, such a reordering always 
exists. 

In the special case p = , ^m) {a £ Ym) we write j3 in the form 

/3: fT(pi,... ,pm) cr{aiqi,.. .,amqm) aq{qi, ...,qm) 

{pi =^* aiqi, i = l,...,m, {a{ai,.. .,am),aq) £ P). 

We illustrate the concepts of F-transducers and F-transformations by 


Example 4.1.3 Let 21 = (S, {x}, {oq, ai}, fl, {y}, P, {oo}), where S = S2 = {<t}, Q = 
0i = {w} and P consists of the productions x aiy and cr{ai, ai) —> oow(^i). 

Consider the tree a{x,x). One of the possible derivations 


a{x,x) ^ a{aiy,x) => a{aiy,aiy) aoUj{y) 
is illustrated by Fig. 14.11 



y 9 


UJ 


ao i 


Figure 4.1 

Thus {a{x,x),co{y)) is in rsi. In fact, consists of this single pair {a{x,x),uj{y)). 
Indeed, the only SX-tree of height 0 is x, which obviously is not in dom(r 2 i). If p G 
Fj]{X) is a tree with height greater than 1, then it should contain at least one of the 
following trees as a subtree: 

a{a{x,x),a{x,x)), a{a{x,x),x) and a(x,a(x,x)). 

One can easily see that none of these subtrees can be transformed by 21. □ 
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4 TREE TRANSDUCERS AND TREE TRANSEORMATIONS 


F-transducers transform a tree from the leaves of the tree towards the root of the tree. 
Now we define a system which works in the opposite direction. 

Definition 4.1.4 A root-to-frontier tree transducer {R-transducer) is a system 21 = 
(S, A, A, D, Y, P, A'), where 

(1) S, X, A, n, Y and A' are specified the same way as in Definition 14.1.11 but here 
A' is called the set of initial states, 

( 2 ) P is a finite set of productions (or rewriting rules) of the following two types: 

(i) ax ^ q {a G A, X e X, q £ Fn{Y)), 

(ii) aa{^i,... ,^rn) ^ q {a € A, a € m>0,q€ Fn[Y U AEm])- 

In the sequel we shall write simply aa for aa{^i,... ,^m)- Moreover, for a production 
p —>■ g we shall use the notation {p, q), too. 

Obviously, a production of type (ii) in Definition 14.1.41 can be written in the form 

aa ... ,amC") 

where a* € > 0, i = 1,..., m, ni+.. .+nm = n, and q € FQ{X[JEn). In the sequel 

we shall assume that whenever 1 < i < m and ni +... + ni_i + 1 < ii < i 2 < ni +... + nj, 
precedes in yd((?)(/?. Here ip is the homorphism defined on p. 11341 
Next we define the transformations induced by R-transducers. Let 21 be the R- 
transducer of Definition 14.1.41 For any a £ A and p £ Fy,{X) we define the subsets 
P'^a,a as follows: 

(i) if p € So U A and {ap, q) £ P then q £ pr^i^a, 

(ii) if p = o-(pi,... ,pm) (o- £ Em, m > 0 ), then for any (aa, Q'(ai^”\ ... ,amCm™)) ^ P 
and qi. £ Pim^ai. (I < i < m, 1 < j < Ui), ^(qi, ..., q^) € prst a where q* = 

(i = l,...,m), 

(iii) nothing is in any pr^^a unless this follows from (i) and (ii). 

Definition 4.1.5 Let 21 = (S, A, A, fl, Y, P, A') be an R-transducer. Then the transfor¬ 
mation induced by 21 is the relation 

'Tst = {{P, q) \ P^ FT.{X),q £ Fq{Y), q £ pT<n^a for some a £ A'}. 

A tree transformation is an R-transformation if it can be induced by an R-transducer. 
The class of all R-transformations will be denoted by TZ. 

For R-transformations we also give another definition which shows how a transforma¬ 
tion is carried out step by step. 

Let p,q £ Fq[Y U AFj]{X U H)] be trees, and consider the R-transducer of Defini¬ 
tion 14.1.41 It is said that p directly derives g in 21 if (7 can be obtained from p by 
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(i) replacing an occurrence of a subtree ax {a G A, x G X) in p by the right side q of 
a production ax in P, or by 

(ii) replacing an occurrence of a subtree aa{pi,... ,pm) {a G A, a G m > 0, 

Pi, ■■■ ,Pm & Fi:iX U E)) in p by q{pi,... ,pm) where aa ^qis in P. 

Each application of steps (i) and (ii) is called a direct derivation in 21. The relation 
expressing the direct derivation will be denoted by i.e., we write p g if g is 
obtained from p by a direct derivation in 21. Frequently, 21 will be omitted in =>21 • Any 
finite sequence of consecutive direct derivations defines a derivation. More precisely, 

p = Po ^ Pi ^ ^ Pi ^ ^ Pj ^ ^ Pk = q (4) 

{k >0,peG Th[y U AFy,{X US)], £ = 0,..., fc, 0 <i < j <k) 

is a derivation of q from p in 21, A: is the length of this derivation and p* ^ ... => Pj is a 

subderivation of (4). If q can be obtained from p by a derivation, then we write p q, 

or simply p g if 21 is understood from the context. Thus, is the reflexive-transitive 
closure of =>. Similarly as in the case of an F-transducer, we suppose that the notation 
p =>* q implies a certain derivation of q from p in 21 . 

Using the notation ^*, the transformation induced by an R-transducer 21 = 
(S, X, A, 12, Y, P, A') can equivalently be defined thus; 

'Tst = {(p, <?) I P G Fj:{X),q G Fq{Y), ap q for some a G A'}. 

Let us note that although an R-transducer 21 is generally a nondeterministic system, 
prsi is finite for every input tree p of 21 . 

Let 21 = {E, X, A,Q,Y, P, A') be an R-transducer. Consider some n > 0, a G A^, 
p G Fs(A)"^, q G Fq(Y)'^ and derivations UiPi =^* qi (i = 1,... , n). Then ap q will 
denote the vector of these derivations. Moreover, we assume that ap q implicitly 
expresses the n derivations aiPi =>* qi (i = 1 ,..., n). 

Take the above R-transducer 2t and a derivation 


a:p^*q {p,q G Fn[Y U AFji{X)]). 


Let 


ripi,P2) r'(pq,p2) ^ ... ^ r{pi^,p2) => r{pi^,P2) 
{r G Fn[Y U AFjiiX UE 2 )]) 


(5) 


be a subderivation of a, where the first k direct derivation steps are carried out in the 
subtree pi, and then in the {k -|- l)th step we apply a production in the subtree p 2 . 
Replacing the subderivation (5) in a by 


r{pi,P2) => r{pi,p2) r(pq,P2) ^ ... ^ r{pi^,p2) 


( 6 ) 


we get a derivation 


/3:p^* q. 
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The replacement of (5) in a by (6) is called an inversion of direct derivations. By 
finitely many applications of inversions we get a reordering of direct derivations. We 
shall not distinguish between derivations in an R-transducer if they are reorderings of 
each other. 

Again, take the above R-transducer 21, a state a G A and a tree p G Fs(A). Then by 
ap = ap{pi,... ,pm) .. .,3.mPZ") q{q.i ,... ,qm) 

{p G Fj:{X U Em), ap ^ 

n* > 0, i = 1,..., m, ni + ... + Um = n, q £ Fn{Y U S„), 

Fl'j • f \ 

^jPj J = ^,---,rn) 

we mean the derivation 

ap{pi, ...,pm)^* q{a.iPi\. ■ .,amP^) 
q{pil(l) 1 ai 2 Pl, ■ ■ ■ , , . . . , amiPm, ■ ■ ■ , amnmPm) . . . 

qipipki), a\2Pl, ■ ■ ■ , ai„^Pl, . . . , amiPm, ■ ■ ■ , amn^Pm) ... 

q{Pll{ki), ■ ■ ■ jPlni )> ■ ■ ■ > ^miPm, ■ ■ ■ , amn^Pm) ■ ■ ■ 

q(Pll(ki), ■ ■ ■ jPlnRki^^), ■ ■ ■ ,Pmi(kmU^ ■ ■ ■ i^^mn^Pm) • • • 

d{Pll(kl)l ■ ■ ■ ) ) ) • • • }Pmi(kmi)’ ■ ■ ■ ’P'm-nmikmn^) ~ 

q{qii ,■■■, qini , • • •, 4mi, • • •, ), assuming 

that ajp”‘ Ri (1 < i < m) has its component derivations 

aijPi => Pi.(1) ^ ... ^ PiRki.) = qij (% e 4 = 1, • • •, ni), 

and ap{pi,... ,Pm) 4 (aip”\ ..., amP^) is obtained by replacing (i = 1 ,..., m) in 
ap g(ai^”i,... by p*. 

When we say that we write the derivation 

a:ap^*q {a £ A,p £ Fj:{X),q £ Ffi{Y)) 

in the more detailed form 

13: ap = ap{pi,... ,pm) g(aip”i,..., a^p”™) g(Ri,...,Rm) 

(p G Ts iyX U ■^m), ap ^) • • • ) ^m^riF )i a* G A ®, Tli ^ 0, 

i = 1,... ,m, ni + ... + Um = n, q £ Fn{Y U S„), ajpJ" ^ r^, j = 1,... ,m), 

it generally also means that /3 is a reordering of a. Obviously, such a reordering always 
exists. 

In case p = c(^i,... ,^m) (c G E^m), we write f3 in the form 

/3: aa{pi,...,pm) ^(aip^i,..., a^p”"*) g(Ri,...,Rm) 

{{aa,q{ai^'l\ .. .,amC^)) G P, a* G A"-% n* > 0, i = 1,... ,m, ni -k ... + = n, 

q e Fn{Y U En), a^pj" r^, j = 1,..., m). 
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Example 4.1.6 Let 21 = (S, {x}, {oq, oi, 02 }; {yi, 2 / 2 }) -P, oo) be the R-transducer, 
where S = Si = {cr}, 12 = Hi U 122, = {<^i} and 122 = {^^ 2 } and P consists of the 

productions 

agcr —> W 2 (ai^i, 02^1), 
aidwi(ai^i), a2a ^ ui{a2^i), 
aix yi, a2X ?/2- 

Consider the trees p = a{a{a{x))) and q = a; 2 (wi(wi(yi)),a;i(a;i(?/ 2 )))- Then a deriva¬ 
tion of q from aop is illustrated in Fig. 14.21 



Figure 4.2 

By induction on the heights of input trees one can easily prove that 
'Tst = {(o-’"(a;),W2(wi"nyi)>a;i“^(y2))) | n = 1,2,...}, 
where ct‘^(C) = C and d"'(^) = if n > 0. □ 

Both F-transducers and R-transducers generalize generalized sequential machines from 
strings to trees (or from unary polynomial symbols to polynomial symbols of arbitrary fi¬ 
nite type if strings are interpreted as unary polynomial symbols, as we did in Section 
At the same time there are the following main differences between F-transducers and 
R-transducers: 

(1) An F-transducer hrst processes an input subtree nondeterministically and then 
makes copies of the resulting output subtree. 
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(2) An R-transducer can first make copies of an input subtree and then process each 
copy independently in a nondeterministic fashion. 

(3) F-transducers should process even those subtrees which are deleted afterwards. 

Before ending this section we state and prove some simple general results. 

The concept of tree homomorphism was introduced in Section 12.41 It is easy to see 
that the tree homomorphism h: Fy,(X) Fq{Y), given by the mappings 

hm-Em^ Fn(Y U Em) {m > 0) 


and 

hx-.X^Fn{Y), 

can be induced by the one-state F-transducer 21 = (S, X, {a}, 0, Y, P, a) where 

P = {x —>■ ahx{x) I X € A} U {cr{a,... ,a) ^ ahm{(x) \ a G m > 0}. 

Definition 4.1.7 A one-state F-transducer 21 = {Tj,X,{a},Q,Y,P,a) is an HF- 
transducer if for every x G A, resp. a G S, in P there is exactly one production 
with left side x, resp. iT(a,..., a). 

We have seen that every tree homomorphism can be induced by an HF-transducer. 
The converse is also true: transformations induced by HF-transducers are tree homo- 
morphisms. 

We now introduce the R-transducer counterpart of HF-transducers. 

Definition 4.1.8 A one-state R-transducer 21 = (B, A, {a}, A, P, a) is an HR- 
transducer if for each d G A U B in P there is exactly one production with the left 
side ad. 

Next we prove that the class of all tree homomorphisms coincides with the class of all 
transformations induced by HR-transducers. 

Theorem 4.1.9 The class of transformations induced by RF-transducers coincides with 
the class of all transformations induced by RR-transducers. 

Proof. Let 21 = (B, A, {o}, H, A, P, a) be an HF-transducer. Consider the R-transducer 
® = (B, A, {a}, H, A, P',a), where P' is given in the following way: 

{ax,q)€P' {x,aq) € P (x G A) 


and 


(acr,g«i,.. .,af,m)) G P' ia{a, ..., a), ag) G P (u G Ym,m > 0,g G Fn{Y\JEm)). 

It is obvious that 23 is an HR-transducer. 
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By induction on hg(p), we show that for an arbitrary p G Fy.{X) and q € Fq(Y) the 
equivalence 

ap q p aq (7) 

holds. This obviously implies 

If hg(p) = 0, then (7) holds by the definition of P'. 

Let p = cr{pi,... ,Pm) (o' £ P'm, m > 0), and assume that (7) has been proved for all 
trees in Fy.{X) with heights less than hg(p). 

Suppose that the left side of (7) holds, i.e., we have ap = aa{pi,... ,pm) 
q{api,... ,apyn) = q, -where (au, g(a^i,..., a^m)) G P' and api 

qi {i = Then, by the definition of P', the production cj(a, ...,a) —>■ 

aq{^i,... ,^m) is in P. Moreover, by the induction hypothesis, pi aqi is valid for 
each z (1 < z < m). Therefore, we have a desired derivation 

p = a{pi,... ,pm) (^iaqi,.. .,aqm) aq{qi, ...,qm) = aq. 

The fact that p aq implies ap =^3 q can be shown by reversing the above argument. 

To see that every HR-transformation is induced by an HF-transducer, it suffices to 
observe that every HR-transducer IB arises from an HF-transducer 21 by the above 
construction. Hence HR- and HF-transducers appear in equivalent “associated” pairs. 

□ 


We prove two more results. 

Theorem 4.1.10 The following statements hold. 

(i) For every F-transformation r C Fy;{X) x Ffi(Y), dom(T) G Rec(S,X). 

(ii) There exists a tree homomorphism h: Fj]{X) FniY) such that range(/z) ^ 
Rec(H,y). 

Proof. In order to show (i) consider an F-transducer 21 = {Ti,X,A,Fl,Y,P,A'). Con¬ 
struct an NDF SX-recognizer B = where B = (H, S), B' = A', and, for all 

m > 0, £7 G Tim and oi..., Om G H, 

£T®(ai, .. .,am) = {a | (3g G F^{Y U Hm))((o'(ai, .. .,am),aq) G P)}. 

Finally let 

x/3 = {a e A \ {3q G FQ{Y)){{x,aq) G P)} {x G X). 

We end the proof of (i) by the observation that for all a G H and p G F^{X) the 
equivalence 

aGp/3 {3q G Fn{Y)){p aq) 

holds. This can be shown by induction on hg(p). 

For a proof of (ii), see Example 12.4.151 □ 

Example 12.4.151 shows also that the translation of a context-free language by a tree 
transducer is not always context-free. In fact, in this example the finite language {x} is 
translated into the non-CF language | zz > 0}. 
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Lemma 4.1.11 For each T G Rec(S,X) there exists an F-transducer 21 such that 
dom(T 2 i) = range(r 2 i) = T and rai is the identity mapping ofT. 

Proof. Let B = (B, (3, B') be an F SX-recognizer with B = {B, S) and T(B) = T. Take 
the F-transducer 21 = (S, X, B, S, X, P, B') where 

P = {x ^ I3{x)x\x e X}U {a{hi,... ,bm) .. ,irn) \ 

m P 0, (T G b, bi,..., bm ^ B, <t (6 i,..., bm) — b}. 

Obviously, 21 has the desired properties. □ 

We end off this Section with 

Definition 4.1.12 Two R- or F-transducers 2t and are equivalent if = rtg holds. 

4.2 SOME CLASSES OF TREE TRANSFORMATIONS 


In this section we shall define several classes of F- and R-transformations and then 
compare them with each other with respect to set theoretic inclusion. It will turn out 
that in most cases the classes to be investigated are incomparable. 

Definition 4.2.1 Let 21 = {E, X, A,Q,Y, P, A') be an F-transducer. Then: 

(1) A production of 21 is linear if each auxiliary variable occurs at most once in its 
right-hand side. Moreover, 21 is a linear F-transducer (LF-transducer) if all of its 
productions are linear. 

(2) 21 is a totally defined F-transducer fiFF-transducer) if 

(i) for each x G A there is a production in P with left-hand side x and 

(ii) for all m > 0, fj G and oi,..., G A there is a production in P with left-hand 
side a{ai,.. .,am)- 

(3) 21 is a nondeleting F-transducer (NF-transducer) if for every production 
a{ai ,..., am) aq {a € m > 0) from P each G Em occurs at least once in 

Q- 

(4) 21 is a deterministic F-transducer {DF-transducer) if there are no two distinct pro¬ 
ductions in P with the same left-hand side. 

(5) 21 is an F-relabeling if each of its productions is of the form 

(i) X ^ ay {x G X,a € A,y € Y) or 

(ii) fT(ai,... ,am) aw(^i,... ,(m), where a G Ym,ai, ... ,am,a € A,u} € Dm- 
Transformations induced by F-relabelings are also called F-relabelings. 


142 



4.2 Some classes of tree transformations 


To illustrate the above concepts, let us take the following example. 

Example 4.2.2 Let 21 = (S, {x}, {oq, oi}, 0, {?/}, P, {ai}) be the F-transducer with S = 
T ,2 = {< 7 } and n = O 2 = {w}, where P consists of the productions 

X aoy, 

a{ao,ao) ^ 2 ), cr(ao, “i) “0^(6,6), cr(ai, uq) 

(T(ai,ai) aia;(6,6)- 

Then 21 is a linear, totally defined, nondeleting and deterministic F-transducer. More¬ 
over, 21 is an F-relabeling. □ 

Example 1 4.1.3 1 gjves an F-transducer which is linear and deterministic, but it is neither 
totally dehned nor nondeleting. 

Let us note that F-relabelings are always linear and nondeleting F-transducers. 

We now define the R-transducer counterparts of the above classes of F-transducers. 

Definition 4.2.3 Let 21 = {Ti, X, A,fl,Y, P, A') be an R-transducer. Then: 

(1) A production of 21 is linear if each auxiliary variable occurs at most once in its 
right-hand side. Moreover, 21 is a linear K-transducer (LH-transducer) if all of its 
productions are linear. 

(2) 21 is a totally defined R-transducer (TK-transducer) if 

(i) for all o € A and x € X there is a production in P with left-hand side ax, and 

(ii) for all o € A and a € Sm (w- P 0) there is a production in P with left-hand side 
aa. 

(3) 21 is a nondeleting K-transducer (NR-transducer) if for every production aa ^ q 

(a G Sm, m > 0) from P each occurs at least once in q. 

(4) 21 is a deterministic R-transducer {DR-transducer) if A' is a singleton and there 
are no distinct productions in P with the same left-hand side. 

(5) 21 is an R-relabeling if each of the productions of 21 has the form 

(i) ax ^ y {a & A,x & X,y £ Y) or 

(ii) aa —> uj{ai^i, ..., am^m), where a, oi,..., € A, a £ ui £ Dim- Transforma¬ 

tions induced by R-relabelings will also be called R-relabelings. 

Example 4.2.4 Let 21= (S, {x}, {uq, oi}, n, {yi, 7 / 2 }) R, {ao}) be an R-transducer with 
S = S 2 = {o'} and 0 = ^2 = {w}- Moreover, P consists of the productions 

aox yi, aix -£■ y 2 , 

aga a;(ai^i, 01 ^ 2 ), aio- w(ao6, “o^)- 

Then 21 is a linear, totally defined, nondeleting and deterministic R-transducer. More¬ 
over, 21 is an R-relabeling. □ 
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The R-transducer of l4.1.6l is deterministic and nondeleting, but it is neither linear nor 
totally defined. 

Let us note that R-relabelings are linear and nondeleting R-transducers. 

The abbreviations introduced above for classes of tree transducers can be combined to 
indicate further subclasses. For instance, an LNF-transducer is a linear nondeleting F- 
transducer. Moreover, a transformation is a K-transformation if it can be induced by a K- 
transducer. The class of all K-transformations will be denoted by 1C. Thus, for example, 
CMJ- is the class of all LNF-transformations, i.e., the class of all transformations induced 
by linear nondeleting F-transducers. By Theorem 14. 1.91 we shall write simply % instead 
of RF and RTZ. Moreover, T'rel, resp. T^rel, will denote the class of F-relabelings, resp. 
R-relabelings. 

We now prove 

Theorem 4.2.5 F and TZ are incomparable. 

Proof. In order to prove Theorem 14.2.51 we give (i) an F-transformation which is not 
in TZ and (ii) an R-transformation which cannot be induced by any F-transducer. 

(i) Consider the LDF-transducer 21 of Example 14.1.31 If for an R-transducer 25 = 

{T,,{x}, B,Q, {y}, P', B') we have {a{x, x),Lo{y)) E r^s, then at the first step of a 
derivation ba{x, x) ^(y) € B') we should apply a production of the form 

ba — > b'^i,ba — )• b'^ 2 ,ba — >• uj{b'^i),ba oj{b’^ 2 ) or ba —>• w(y), where b' E B. In 
each of the above cases one of the auxiliary variables and ^2 is deleted. Therefore, 
dom(r 5 s) is infinite. 

(ii) Take the DR-transducer 21 of Example 14.1.61 Assume that an F-transducer 
= {E, {x}, B,D, {yi,y 2 }, P', B') induces rsi. Obviously, P' should then contain a 

production of the form 

a{b) 61 ^ 2 ( 91 , 92 ) ( 6,61 E B). 

We may confine ourselves to the following cases: 

(I) 91 = a^iyi) and 92 = cr^(y 2 ), 

(II) 91 = o-'(^i) and 92 = cr’^{y 2 ), 

(III) 91 = a^{yi) and 92 = 0-^(6), 

(IV) 91 = and 92 = o-''( 6 )- 

Obviously, in a derivation cr^(x) b'uj 2 {u}[ ^{yi),ujl ^( 1 / 2 )) {r > 1,6' E B') the last 
application of the above productions can be followed by applications of productions of 
the form a(b) —>■ 61^1 ( 6,61 E B) only. Let t be the maximum of exponents in (I)-(IV). 
If r > t-|- 1 and T(q{F{x)) = 012 (w]"”^(?/,), a;]’“^(yj)) (1 < i,j < 2) then i = j. □ 

From the proof of Theorem 14.2.51 we directly get 

Corollary 4.2.6 DF and VTZ are incomparable and so are VF and TZ, and F and VTZ. 

□ 
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4.2 Some classes of tree transformations 


As we have mentioned one of the main differences between F- and R-transducers is 
that while F-transducers hrst process an input subtree and then copy the resulting 
output subtree, R-transducers first copy an input subtree and then treat these copies 
independently. In the case of an LR-transducer none of the input subtrees of a tree is 
copied during the translation of the tree. This property leads to 

Theorem 4.2.7 CTZ is a proper subclass of CJ-. 

Proof. By (i) in the proof of Theorem 14.2.51 LT is not a subclass of CR,. Thus, it is 
enough to show the validity of CR, C CT. 

Let 21 = (S, X, A, fl, Y, P, A') be an LR-transducer. Then the productions from P can 
be written in the form 

(i) ax ^ g (a G A, X G AT, g G Fb(F)), or 

(ii) ■ • ■,am(.m) (a, oi,..., G A, m > 0, a G G Fn[Y U 

A^rn])- 

Now take the following R-transducer 21. If 21 is nondeleting, then 21 = 21. In the 
opposite case 21 = (Y, X, A,fl,Y, P, A') is given as follows. Let A = AU {*} (* 0 A). 
Fix any y ^ Y and enlarge P by all productions *x —)■ y (x G X) and *a ^ y {m > 
0, cr G Ym)- Denote by P the resulting set of productions. Obviously, 21 is linear and 
equivalent to 21. The only difference between 21 and 21 is that 21 transforms (in state 
*) even those subtrees of a tree p G Fj]{X) which are deleted during the corresponding 
derivation of p in 21. 

Next, construct the F-transducer IB = {Y,X,B,f},Y,P',B'), where B = A and B' = 
A'. Moreover, given any x & X,b & B and q G Fq{Y),x —)• bq is in P' iff 6x —>■ g is in P. 
Furthermore, the production 

(t(6i,..., 6m) y bq(^^i ,..., ^m) £ Llm, m P 0,bi,, b^yi, b B, q ^ Fq(R U .^m)) 

is in P' iff P contains a production 

ba q{cif,i 


such that for each z = 1,..., m. 


J Cj if f,i occurs in q, 
( * otherwise. 


Obviously IB is linear. 

In order to complete the proof of Theorem 14.2.71 it is enough to show that the equiv¬ 
alence 

p bq bp q (1) 

holds for all 6 G G Fy:{X) and q G Fq{Y). We shall proceed by induction on hg(p). 
If hg(p) = 0, then ([1]) obviously holds by the definition of P'. 
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Now let p = a{pi,... ,Pm) (<7 € Sm, m > 0), and assume that ([I]) has been proved for 
all trees in Fy,{X) of lesser height. 

(I) Let p bq hold. More in detail, let 

p = a{pi,... ,pm) cr{biqi,... ,bmqm) =^<B bq{qi, ...,qm)=bq 

where pi biqi (i = 1,... ,m). Then by the induction hypothesis, we have biPi =>|- 
qi {i = 1,..., m). Moreover, by the definition of P', ba q{bi^i, ..., bmim) is in P. 
Therefore, 

bp = ba{pi ,... ,pm) qibipi,bmPm) qiqi, ■■■,qm) = q 

also exists in 21. 

(II) Assume that in 21 we have a derivation 

bp = ba{pi,... ,pm) => q{bipi,bmPm) =>* q{qi, ■..,qm) = q 

where each g'j (z = 1,..., m) is obtained by a derivation biPi =>* qi in 21. Moreover, let 
bi = * and qi = y it^i does not occur in q. Then a{bi, ..., bm) —>■ bq is in P'. Furthermore, 
by the induction hypothesis, there are derivations pi biq^ (z = 1,..., m). Therefore, 
the derivation 

p = Z7(pi,... ,pra) (y{biqi ,.. .,bmqm) bq{qi, ...,qm)=bq 


is also valid. □ 

For linear nondeleting tree transformations we have the following stronger result. 
Theorem 4.2.8 CAfTZ = CNF. 

Proof. The LF-transducer 25 constructed to the LNR-transducer 21 in the proof of the 
previous Theorem is obviously nondeleting. 

Conversely, let Gl = {E,X,C,Q,Y, P",C') be an arbitrary LNF-transducer. Construct 
the R-transducer 21 = (S, X, C, Y, P, C'), where P is defined as follows: 

{ax, q) G P (x, aq) € P" 


and 


{aa, q{ai^i, ■ ■ ■, am^m)) G P I' • • •) o,m)^ • • • j ?m)) G P , 

where x G X,a,ai,..., am G A, a G {m > 0) and q G Fn{Y U Em)- Obviously, 21 is 
an LNR-transducer. Now to 21 construct the F-transducer 25 as in the proof of Theorem 
KTTl Then 25 = (£. □ 

The LF-transducer 25 constructed to an R-relabeling in the proof of Theorem 14.2.71 is 
obviously an F-relabeling. Moreover, the R-transducer 21 given to an F-relabeling C in 
the proof of Theorem 14.2.81 is an R-relabeling. Thus, we have 
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Corollary 4.2.9 J^rel = T^rel. □ 

According to Corollary 14.2.91 we may speak simply about relabelings. 

One can easily show the existence of an LNF-transformation which is not a relabeling. 
Our comparison results can be summarized by the diagram in Fig. 14.,11 



Figure 4.3. 


4.3 COMPOSITIONS AND DECOMPOSITIONS OF TREE 
TRANSFORMATIONS 

Let /C be a class of tree transformations. We say that JC is closed under composition if rio 
T 2 G JC whenever ri, T 2 € JC. As we shall see, some of our classes of tree transformations 
are closed under composition while others are not. On the other hand, in many cases it 
is possible to decompose a tree transformation into a composition of simpler ones. 

For any two classes /Ci and JC 2 of tree transformations, we introduce the notation 
/Cl o /C 2 = {ri o r 2 I Ti € JCi,T 2 G JC 2 }. Using this notation, the closure of a class JC of 
tree transformations under composition can be expressed by the inclusion JC o JC (G JC. 
Similarly, the fact that all transformations in JC can be given as compositions of a 
transformation in JCi by a transformation from JC 2 can be expressed by /C C /Ci o /C 2 . 
Finally, if /C is a class of tree transformations, then let JC^ = JC and JC^ = JC o JC^~^ 
(n > 1). All of the classes defined in the previous section {TZ,F,CF,% etc.) include all 
identity transformations {(t, t) | t G Fs(A)}. Hence, if JC is any one of these classes, 
then we know that 

JCCJC'^CJC^C.... 

First we prove a decomposition theorem concerning F-transformations. 

Lemma 4.3.1 J- C CJ- o fi and T C CIZ o ff. 
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Proof. Let 21 = (E, X, A, A, Z, P, A') be an arbitrary F-transducer. Arrange the pro¬ 
ductions from P in a fixed order and number them from 1 to |P|. For all i{= 1,..., |P|), 
if the left side of the Ah production is x E A, then let be a new letter. Denote 
by Y the set of all such x^*\ Moreover, for all i{= 1,..., |P|), if the symbol a E 
(m > 0) occurs in the left-hand side of the Ah production, then will be a new m-ary 
operator. The set of all such operators will be denoted by D. 

Now we introduce the F-transducer = (S, X, A, D, Y, P', A'), where P' is defined as 
follows; 

(i) X ax^*) (x E A, a E A) is in P' iff the ith production in P is x —>■ ar for some r, 

(ii) ct(oi, ... ,Om) afTW(^i,... {o- E Ym,m >0, ai,... ,Om E A) is in P' iff the 

Ah production in P is a{ai ,..., am) —>■ ar for some r. 

Obviously, is linear and nondeleting. Thus, by Theorem 14.2.81 r® is a linear nondelet¬ 
ing R-transformation, as well. 

Next define the F-transducer C = (D, Y", {cq}. A, Z, P", cq) in the following way: 

(i) x^*^ —)■ cqv is in P" iff the ith production in P is x —>■ or, 

(ii) ..., Co) —7> Cor is in P" iff the Ah production in P is 0 ( 01 ,..., am) ar. 
Then (t is an HF-transducer. 

We prove that = r® o T(r. For this it is enough to show that, for all p E Ps(A), 
r E Pa(Y) and a € A, the equivalence 

p^^ar {3q e Fn{Y)){p aq Aq Cor) (1) 

holds. We proceed by induction on hg(p). 

If hg(p) = 0, then ([1]) obviously holds. 

Assume that p = cr{pi,... ,Pm) {a E m > 0) and that ([I|) has been proved for all 
trees from Fy,{X) of lesser height. 

(I) Let 

p cr{airi,.. .,amrm) ar(ri, ...,rm) = ar, (2) 

where pi a^ri {ri E Pa(A)) holds for each i{= 1,... ,m). Then, by the induction 
hypothesis, there are trees qi E Fq{Y) (i = 1,... ,m) such that pi aiqi and qi cor* 
hold. Assume that the production fT(ai,..., am) ar last applied in ([2]) is the Ah one 
in P. Then 

{a{ai,... ,am),aa^''^Ci,... ,^m)) € P' and (cq, ...,cq), cor) E P". 
Therefore, taking q = cr^^^qi ,..., qm), we have the desired derivations 
a{aiqi,...,amqm) =^'B aa^'-^qi,... ,qm) = aq 

and 

q =^* a(*)(cori,.. .,Corm) Cor(ri, ...,rm)= Cor. 

(II) The fact that the right side of ([T]) implies its left side can be proved by inverting 

the above computation. □ 
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Lemma 4.3.2 F 0 % F. 

Proof. Let 21 = (S, X, P, P, A') be an F-transducer and = 

{bo}, A, Z, P',bo) an HF-transducer. We shall construct an F-transducer € 
whose productions will be composed of productions of 21 and derivations in 23. For this, 
using the fact that derivations in 23 can be started from trees in Po[y U boE] (see p. 
□SSI), we dehne derivations in 23 for trees in Fq(Y U Take two trees q G Fci{Y U 
and r G F^{Z U Em)- We write q bor if 

q{bo^i, ... ,6oCm) bor 

holds. Now dehne an F-transducer Gi = {T,, X, A, A, Z, P", A'), where P" is given as 
follows; 

(i) X ^ ar {x G X, a G A, r G F,\{Z)) is in P" iff there is a production x ^ aq in P 
such that q bor holds, 

(ii) cr(ai,... ,am) ar {a G Ym,m > 0, ai,..., a™,, a G r G F/^{Z U Em)) is in P" 
iff there is a production cr(ai,..., am) —>■ ag in P such that q bor holds. Since 
at each step of the transformation of a tree the number of applications is hnite, 
P" is hnite. 

We prove that for all a G G Ps(X) and r G F/\{Z) the equivalence 

p^lar {3qG Fn{Y)){p^^aqAq^^bor) (3) 

holds. We proceed by induction on hg(p). 

If hg (p) = 0 then ([S]) obviously holds. 

Assume that p = (j{pi,... ,Pm) (o' G m > 0) and that (l3|) has been proved for all 
trees from Ps(X) of lesser height. 

(I) First we show that the right side of ([3]) implies its left side. For this assume that 
the derivations 

p a{aiqi,.. .,amqm) aq{qi, ...,qm) = aq 

{pi atqi, i = l,...,m) 

and 

q q{bori, • • •, borm) bor{ri ,..., r^) = bor 
(qi bori,i = 1,..., m) 

are given. Then, by the induction hypothesis, the relations pi Ojri (i = 1,... ,m) 
also hold. Moreover, by the dehnition of P", < 7 ( 01 ,... ,Om) ^ of is in P". Thus, we 
have the derivation 

p cr{airi,.. .,amrm) ar(ri, ...,rm) = ar. (4) 

(II) Suppose that (jl]) and the derivations pi a^rj (i = 1,..., m) are valid. Then, by 
the induction hypothesis, there are trees qi G Ffi(Y) (i = 1,... ,m) such that pi aiqi 
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and Qi boVi hold. Moreover, by the definition of P", there exists a q € Fq{Y U Em) 
with (cr(ai,..., am), aq) € P and q boY. Therefore, for q = q{qi ,..., qm) 

p a{aiqi,... ,amqm) ■■■,qm) = aq 


and 

q qibon,boTm) bor{ri ,..., r^) = bor 
hold. □ 

From Theorem 14.2.71 and the Lemmas 14.3.11 and 14.3.21 we directly obtain 

Theorem 4.3.3 F = CP oR = CTZ o R. 

The constructions in the proofs of Lemma [4 .3.1 1 and 14 .3.2 1 oreserve determinism 
we have 

Corollary 4.3.4 VP = CVP oR. □ 

Now we investigate some special classes of F-transformations for closure under com¬ 
position. 

Lemma 4.3.5 Let 21 = (S, X, A, fl, T, P, A') be an F-transducer. Then there exists a 
totally defined F-transducer 23 = {E, X, B,D,Y, P', B') such that = r<g. Moreover, if 
21 is linear, then 18 can be chosen linear, too. 

Proof. Let B = AO {*} and B' = A'. The required *8 results if we put 

P' = PVJ [x ^ *y \ x ^ X,y ^Y}0 {a{bi,... ,bm) ^ *y \ a G Em, 
m > 0,bi,...,bm G B,y G T}. 

If 21 is linear, then so is *8. □ 

Theorem 4.3.6 The following equalities hold: 

(i) CP o CP = CP, 

(ii) cnocn = cp. 

Proof. In order to show (i), take two LF-transducers 21 = {E,X,A,Fl,Y,P,A') and 
*8 = {D,Y,B,A,Z,P',B'). In view of Lemma 14.3.51 we may assume that 18 is totally 
defined. Construct an F-transducer (f = {E,X,C,A,Z,P'',C') with C = A x B and 
C' = A' X B' . Furthermore, P" is defined as follows: 

(I) X —)• (a, h)r {x G X, (a, b) G C,r G F^{Z)) is in P” iff there is a production x ^ aq 
in P such that q br holds. 


□ 

. Thus, 


150 














4.3 Compositions and decompositions of tree transformations 


(II) cr((ai, 6i),..., {am, hm)) (a, b)r 

(fJ € 'ff‘m, ^ ^ 0) (®1) ^1)) • • • ) {o>m, bm'), (®) € C,T € F/^{Z U i^m)) 

is in P" iff there is a production (T(ai,..., am) —> aq in P such that • • •, bm^m) 

br holds. 

We shall prove that for arbitrary p E Fj]{X),r E F^{Z) and (a, b) £ C the equivalence 

p^l{a,b)r {3q £ Fn{Y)){p aq A q br) (5) 

holds. We proceed by induction on hg(p). 

If hg(p) = 0, then ([5]) obviously holds. 

Now let p = cr{pi,... ,Pm) (<7 E Sm, m > 0), and assume that ([5]) has been proved for 
all trees of lesser height. 

First we show that the right side of ([5]) implies the left side. Suppose we are given 
derivations 

p a{aiqi,... ,amqm) ^2t al{(li, ■■■,qm) = aq 

and 

q q{biri ,..., bmTm) br{ri, ...,rm) = br 

where pi aiqi and qi 6jrj (i = 1,... ,m). (Observe that for each i {1 < i < m) 
there exists an r* such that qi biVi holds since 53 is totally defined.) Then, by 
the induction hypothesis, the derivations pi {ai,bi)ri {i = 1 ,... ,m) are also valid. 
Furthermore, by the definition of P", the production 

(t((oi,6i), ..., {am,bm)) -A {a,b)r 

is in P". Therefore, we get the derivation 

p cr{{ai, bi)ri ,..., {am, bm)rm) (“, b)r{ri ,..., r^) = {a, b)r. 

The fact that the left side of ([5]) implies its right side can be shown by reversing the 
above argument. 

In order to prove (ii) it is enough to note that the HF-transducer £ constructed to the 
LF-transducer 21 in the proof of Lemma l4.3.1l is also linear. Moreover, by Theorem l4.2.7l 
the inclusion CR C CP holds. □ 

Using an argument similar to that used in the proof of Theorem 14.3.61 (i), one can 
prove 

Theorem 4.3.7 The classes VF and P are closed under composition. □ 

From Theorem 14.3.71 by Theorem 14.3.61 fii. we get 
Corollary 4.3.8 The class CDF is closed under composition. □ 
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Using our decomposition results, one can prove 
Theorem 4.3.9 T o VT = T. □ 

Now we turn to decomposition of R-transducers. 

Lemma 4.3.10 TZ ^Ro CTZ. 

Proof. Let 21 = (S, X, A, A, Z, P, A') be an arbitrary R-transducer. Let n be the greatest 
integer with ^ 0. For any production d £ P and natural number i (1 < i < n), 
denote by k(d, i) the number of occurrences of in the right-hand side of d. Set k = 
max{fe(d, i) | d € P, i = 1,... ,n}. Furthermore, take the ranked alphabet given by 
^ I "1 > 0) and Orn-k = {(t' \ a £ (m > 0). 

Let = (E, X, {bo},0, X, P',bo) be the HR-transducer where P' consists of all pro¬ 
ductions 

box X {x £ X) 

and 

boa -£ ,..., (cj G m > 0). 

Next define an LR-transducer C = (11, X, A, A, Z, P”, A'), where P" is given as follows: 

(i) ax ^ r {x £ X) is in P" iff it is in P. 

(ii) Let a £ Em {m>R) and G with = i{i-i)k+j (i = 1, • • •, m, j = 1,..., /c). 

Then aa' —)■ r(ai^^,... ,am^rn) is in P" iff aa —)■ r(ai(^”\ ... is in P (for 

some ni,..., rim)- 

For each p £ Fs(A) let us denote by p' £ Pq(X) the tree given as follows: 

(I) if p = X G X, then p' = x, 

(II) if p = a(pi,... ,pm) (o- £ Em,m > 0), then p' = a'{pf,... ,p'^). 

It is easy to show that the transformation rgs is exactly the mapping p ^ p' (p £ 

Fj:{X)). 

In order to prove rji = r® o re it is enough to show that for all a G G Fj:{X) and 
r £ Fa{Z) the equivalence 

ap r <;=> ap' =>£ r (6) 

holds. We proceed by induction on hg(p). 

If hg(p) = 0 then, by the choice of P", ([6]) is obviously valid. 

Now let p = a{pi, ... ,Pm) (<7 G Em, m > 0), and assume that dH) has been proved for 
all trees of lesser height. 

First we prove that the left side of ([6]) implies its right side. Assume that 
ap ^2t r(ri,..., r^) = r 
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where ajp”* ~ 1, ■ ■ ■ ,m). Then, by the definition of P", the production 

aa' —>■ r(ai^;^,..., is in P". Moreover, by the induction hypothesis, there are 

derivations =>2 r* for all i{= 1,... ,m). Therefore, we have the desired derivation 

ap' r{aip ^\..., amp'Z”") ^(ri,..., r„) = r. 

The fact that the right side of Q implies its left side can be proved by the converse 
of the computation above. □ 

Lemma 4.3.11 % oTZ PfZ. 

Proof. Let 21 = (S, X, {oq}, T, P, uq) be an HR-transducer and iB = (0, Y, B, A, Z, P', 
B') an arbitrary R-transducer. Take the R-transducer = (S, X, B, A, Z, P", B'), where 
P" is given in the following way: 

(i) bx^r{bGB,x€X,r£ F^{Z)) is in P" iff there is a production a^x q in P 
such that bq r holds; 

(ii) ba ^ r {b & B,a & m > 0, r G Fa[Z U BEm]) is in P" iff there is a production 
aocr q{ao^i,..., ao^m) (q G Po(T U Em)) in P such that bq r holds. 

To show T%o = T(t it is enough to prove that for arbitrary b & B,p £ Fy,{X) and 
r G Fa{Z) the equivalence 

bp^lr {3q£ Fn{Y)){aop^^qAbq^’^r) 

holds. This can be carried out by induction on hg(p). □ 

From Lemmas 14.3.lUI and 14.3.111 we directly get 

Theorem 4.3.12 TZ = P o CP. □ 

Using Theorems 14.3.31 and 14.3.1^ we obtain 

Theorem 4.3.13 For each n > 1 the inelusions P” C and C hold. □ 

Taking n = 1 in Theorem 14.3.131 we see that every F-transformation can be given as 
the composition of two R-transformations, and each R-transformation can be obtained 
as the composition of two F-transformations. Thus, taking Theorem 14.2.51 into account, 
we get 

Corollary 4.3.14 Neither P nor P is closed under composition. □ 

One can show that P is not closed under composition by LNF-transformations either. 
For P, we have 

Theorem 4.3.15 P o CMP = P. 
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Proof. By Theorem 14.3.121 it suffices to show that CTZ is closed under compositions by 
LNR-transformations. 

Let 21 = (S, X, A, D, Y, P, A') be an LR-transducer and = (fl, Y, B, A, Z, P', B') an 
LNR-transducer. Take the R-transducer Ci = {T,, X,C, A,Y, P",C') with C = A x B 
and C = A' X B'. Moreover, P" is given as follows: 

(i) (a, b)x —>■ r ((a, h) G C, x £ X,r £ Fa.{Z)) is in P" iff there is a production ax ^ q 
in P such that bq r holds. 

(ii) {a,b)a ^ r{{ai,bi)ii 

((a,6),(ai,6i),...,(a 

m •} ) £ C ^ G £ Tfi ^ 0, r G TA \Z U C ) 

is in P” iff there is a production aa —>• q{ai^i ,..., am^m) (q € Fq(Y U Hm)) in P 
such that bq r(6i^i,..., bm^m) holds. 

In order to show tc = t%o rgs it is enough to prove that for arbitrary (a, b) £ C,p £ 
Fj:{X) and q £ F^{Z) the equivalence 

{a,b)p^*^r {3q£Fn{Y)){ap^*^q^bq^*^r) 

holds. This can be done by induction on hg(p). □ 

Later on we need the following results. 

Lemma 4.3.16 Let r C Fj^{X) x Fq{Y) be an arbitrary F-transformation and T £ 
Rec{n,Y). Then Tr”! € Rec(S,X). 

Proof. By Lemma fl.l.lll there exists an F-transducer 21 with dom(r 2 i) = range(r 2 t) = T 
and T% is the identity mapping on T. Moreover, by the proof of Lemma 14.1.111 we may 
suppose that 21 is deterministic. Furthermore, by Theorem 14.3.91 F oDT = T. Thus, 
since Tt~^ = dom(r o r^), in order to prove Lemma 14.3.161 it is enough to show that 
the domain of an F-transformation is recognizable. But this is true by (i) of Theorem 
14.1.101 □ 

From Theorem 14.1.101 and Lemma 14.3.161 using the inclusion IZ C (see Theorem 
14.3.1311 . we get 

Corollary 4.3.17 Let r C Fj^(X) x Fq{Y) be an arbitrary R-transformation. If T £ 
Rec(r2,y), then Tt~^ £ Rec(S,X). In particular, dom(r) £ Rec(S,A). □ 

4.4 TREE TRANSDUCERS WITH REGULAR LOOK-AHEAD 

Consider an F-transducer 21 = (E, X, A,Ll,Y, P, A'). Take a tree p = a{pi,... ,pm) £ 
F^ {^X ) ((T £ PifYi, m ^ 0) and a derivation u(pi,..., Pm) ^(^i 1?i; ■ ■ ■ ) (^mQm) (Oj £ 

A,qi £ FQ(Y),pi =^* aiqiO = l,...,m). Then, knowing the states ai,...,am, our 
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transducer can decide which production a(ai,..., am) q to apply next. In other 
words, after inspecting the properties of the subtrees pi,... ,pm, the F-transducer 21 can 
select the production to be applied in the next step of the translation of p. Moreover, 
these properties of subtrees are regular in the sense that dom(r 2 i(ai)) is a regular forest 
for each i{= 1,... ,m). Obviously, R-transducers lack this possibility. This observation 
leads to the idea to provide R-transducers with regular look-ahead as follows. 

Definition 4.4.1 A root-to-frontier tree transducer with regular look-ahead (Rn-trans- 
ducer) is a system 21 = (S, A, A, fl, T, P, A'), where 

(1) S, A, A, and A' have the same meanings as in Definition 14.1.41 

(2) P is a finite set of productions (or rewriting rules) of the form (p —)■ q,D), where 
p —>• q is an R-transducer production and D is a mapping of the set of all auxiliary 
variables occurring in p into Rec(I], A). 

If p is of the form ax (x € A) or aa with cr € Sq, then the domain of D is empty. We 
write such rules generally as ax ^ q and aa — q, respectively. Moreover, for any a € A, 
we put 21(a) = (S, A, A, D, Y, P, a). 

Definition 4.4.2 Let 21 be the Ru-transducer of Definition 14.4.11 21 is called determin¬ 
istic if the following conditions are satisfied: 

(i) A' is a singleton. 

(ii) If (pi —)■ qi,Di) and (p 2 —^ 92 ) ^> 2 ) are two productions in P with pi = P 2 , and 
qi ¥" 92 ) then there exists an i (1 < z < m) such that Di(^i) n 02 (^ 1 ) = 0) where 
m is the number of auxiliary variables in pi{= P 2 )- 

Linear and nondeleting RR-transducers are defined in the same way as their R- 
transducer counterparts. 

Definition 4.4.3 Take an R^-transducer 21 = (S, A, A, D, T, P, A'), and let p,q G 
Po[y U APs(A)] be two trees. It is said that p directly derives g in 21 (in notation, 
p =^ 2 i q) if q can be obtained from p 

(i) by replacing an occurrence of an ax {a (z A, x (z A) in p by the right side g of a 
production ax —)■ g in P, or 

(ii) by replacing an occurrence of a subtree aa{pi,... ,pm) {a € A, a £ 'Lm-,'m > 
0,pi,.--,Pm G -^s(A)) in p by q{pi,. ■ ■ ,Pm), where (aa q,D) is in P and 
Pi G D(^i) for each i(= 1,..., m). 

A sequence 

P = Po ^21 Pi ^2t • • • ^21 Pfc = 9 (^ > 0) 

obtained by consecutive applications of direct derivations is a derivation of g from p in 
21. When such a derivation exists, we write p g. Again, this notation will also be 
used to indicate a certain derivation. 

If there is no danger of confusion, then we generally omit 21 in =>21 and . 
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According to Definition 14.4.31 the difference between derivations in R-transducers and 
RR-transducers is that in case of an RR-transducer 21 a production aa ^ q can be 
applied to a tree aa{pi,... ,Pm) if and only if there is a production {aa —)• q,D) of 21 
such that each subtree pt (1 < * < m) is in the recognizable forest D{^i). 

Definition 4.4.4 Let 21 = (S, X, A, D, R, P, A') be an RR-transducer. Then the relation 

La = {{pm) I P € F^{X),q € Fq{Y), ap q for some a G A'} 

is called the transformation induced by 21. 

A relation r is an transformation if there exists an RR-transducer 21 such that 
r = ra- 

Linear, nondeleting and deterministic Rji-transformations are defined in an obvious 
way. 

The class of all RR-transformations will be denoted by IZr. 

Let us note that there exists a recursive definition of transformations induced by RR- 
transducers. This can be obtained by an obvious modification of the corresponding 
definition of transformations induced by R-transducers. 

Moreover, for RR-transducers the notion of a reordering of direct derivations can be 
defined in the same way as in the case of R-transducers. Furthermore, the remarks 
concerning different forms of derivations in R-transducers are valid for RR-transducers, 
too. 

To illustrate the concepts of RR-transducers and RR-transformations, consider 

Example 4.4.5 Let X = {x} and S = Si U S 2 , where Sj = {cjj} {i = 1,2). Take the 
forests Ti = {cri(x)}** and T 2 = {(Ti(x)}. Let 21 = (S, X, {oq, ui}, D, R, P, oq) be the 
RR-transducer where D = = {w}, R = {y} and P consists of the productions 

(aoo'2 —t w(ai^i). Pi) (Pi(^i) = Ti, Pi(^2) = T 2 ), 

(aiCTi —w(ai^i), P2) (P2(.^i) = Ti), 
aix —)• y. 

Then = {{a2{(7i{x),ai{x)),uj'^~^^{y)) \ n = 0,1,...}. Observe that (without 
regular look-ahead) the corresponding R-transducer would induce the transformation 
{{(T 2 {(T^{x),p),uj"'+^{y)) |p€Ps(X),n = 0,1,...}. □ 

Obviously R-transducers are special cases of RR-transducers. On the other hand, RR- 
transducers can restrict the domain of possible subtrees of input trees even if these are 
deleted. In fact, no R-transducer could induce the rgt considered in the above example. 
Assume that such an R-transducer 

2S = (S,X,P,D,R,P',P') 

exists. Then for every n(> 0), the production applied first in a derivation boa 2 {cri{x), 
<Ti(x)) uj'^~^^{y) {bo € B') should be of the form 
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(i) 6 o<T 2 q{b^i) or 

(ii) boa 2 q{b^ 2 ) {b£ B,q = m > 0). 

Let k be the maximum of the heights of right sides of productions from P' and n > 3k. 
Then the considered production should be of the form (i). But in this case all pairs 
{a 2 {(Ti {x),p), {y)) {p € Fj^{X)) are in r®, which is a contradiction. □ 

Theorem 4.4.6 The following inclusions hold: 

(i) Fr C VPrel o TZ, 

(ii) CTZr C VPrel o CTZ, 

(iii) VTZr C VPrel o VTZ, 

(iv) CVTZr C VPrel o CVTZ. 

Proof. Let 2t = (S, X, A, A, Y, P, A') be an arbitrary RR-transducer. Let Ti,..., (C 
Ft.{X)) be all regular forests which appear as images in the D-mappings of the produc¬ 
tions in P. Denote by V the set of all A:-dimensional vectors with components 0 or 1. 
Now take a ranked alphabet D, where Dq = Sq, and for each m > 0,flm = x V^. 

Thus, the elements from (m > 0) can be given in the form (a, (vi,..., v^)), where 

(T G Em and vi,..., G R 

Let Aj = {Ai,ai, A'f) be SA-recognizers with Ai = (Aj,S) and T'(Aj) = Ti {i = 

1,..., /c). We introduce the F-transducer 53 = (E, X, B, kl, X, P', B') where B = B' = 

Ai X ... X Ak and P' consists of the following productions: 

(I) X —>■ (xoi,..., xak)x {x G A), 

(II) a (cr-^i, . . . , cr-^'')(T (cr G Eq), 

(III) u(ai ) ^ a(( 7 , (vi ) • • • ) Vm ))(Cl) • • • ) ?m) 

{a G Em,m > 0; a, a* G R, v* G R, i = 1,... ,m), 

where 

a = (fT-^i(aii,... ,ami), • •.,(or> • • • >amj) 

and Vi. = I iff Ui. G Aj. Obviously, 53 is a deterministic F-relabeling. 

One can easily show that 53 relabels every EA-tree p in the following way: 

(a) if p G A U Eq, then T®(p) = p, 

(/3) if p = o-(pi,... ,pm) (o- G Ym,ni > 0) then r^sip) = (a, (vi ,... ,Vm))iT^(pi), • • •, 
T^(Pm)), where = I iff p* G (1 < i < m,l < j < k). 

Next construct the R-transducer (t = {Q, X, A, A,Y, P", A') where P" consists of the 
productions below: 
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{a') ap ^ r {a ^ A,p ^ X U r € Fa{Y)) is in P" iff it is in P, 

{13') a{a,{vi,... ,Vrn)) r {a e A;a e Em,m > 0]Vi e V,i = I, - ■ ■ ,m-r £ Fa[Y U 
AEm]) is in P" iff (a, (vi,..., v^)) occurs in a tree T^sip) {p £ Fj]{X)) and P 
contains a production {aa —)■ r,D) such that vi- = 1 whenever -D(Ci) = Tj (1 < 
i < m,l Y j < k). 

In order to prove = r® ore it is enough to show that for arbitrary a £ A,p £ F^{X) 
and r £ Fa{Y) the equivalence 

ap r arts {p) ^ 

holds. This can be carried out by induction on hg(p). 

It is also easy to show that is deterministic (linear) if 21 is deterministic (linear). □ 

Theorem 14.4.61 liiil shows that DR^-transducers induce (partial) mappings. 

Next we show that TZr is closed under certain special F-transformations. 

Theorem 4.4.7 The following inclusions hold: 

(i) Pr o -CJ' C TZji^ 

(ii) DPn o DC.F C DTZji^ 

(iii) DTZfi o DCfJZ C 

(iv) VTZr oR C VIZr. 

Proof. Let 21 = {Ti,X,A,D,Y,P,A') be an RR-transducer, and take an LF- transducer 

= {0,Y,B,X,Z,P',B'). 

We want to treat cases (i) and (ii) together. Since the set of initial states 
of a DRR-transducer should be a singleton we shall use the LF-transducer IB = 
{I},Y,B,A,Z,P',bo) instead of *B, where B = B U bo {bo ^ B) and P' is obtained 
by enlarging P' by the following productions: il y ^ bq {y £ Y), is in P' and b £ B', 
then y —)■ b^q is in P . Similarly, if a{bi ,..., bm) bq {a £ Ym,m > 0) is in P' and 
b £ B' then the production ct(6i, ..., bm) boq is in P . It is obvious that 

Construct the RR-transducer C = (S, X,AxB, A, Z, P" ,A' x {6 o})) where P" is given 
as follows: 

(I) {a,b)p r {a £ A,b £ B,p £ X Li T,Q,r £ Fa{Z)) is in P" iff there exists a 
production op —>■ g in P such that q br holds. 

(II) Assume that the production {aa —>■ g(ai^”\ ..., amfDu), D) {a £ A] a £ Em, rn > 
0; a* £ A^^i = 1,..., m; ni + ... + = n,q £ Ffi{Y U H„)) is in P and that there 

is a derivation g(bi^i,..., br{^^,... ,^^) with b £ B]hi £ E 

= Ui+...+ni_i+j, 1 < j < ni,i = 1,... ,m and r £ Fa{Z U H„). Then P" 
contains the production {{a, b)a —r(aibi^”\ ..., ambmCm""), D'), where D'{^i) = 
n(Tj((|j.,)(dom(r 23 (b,^))) | j = l,...,ni) O D{^i) {i = l,...,m). If 5 E P', then 

{{a,bo)a r(aibi^”\ ... ,amhm^^), D') is also in P". 
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By Corollary 14.3.171 the domain of an R-transformation is regular. Moreover, also 
by Corollary 14.3.171 the inverse of an R-transformation preserves regularity. Thus, by 
Corollary 14.2.91 and Theorems 14.4.61 and 12.4.21 D'{^i) {1 < i < m) is regular. 

In order to show raor^ = it is enough to prove that for all (a, 6) € Ax B,p £ F^{X) 
and r £ F/\{Z) the equivalence 

{a,b)p^*^r {3q £ Fn{Y)){ap Aq br) 

holds. This can be done by induction on hg(p). 

One can easily check that if 21 and 25 are deterministic, then so is (t. Thus, (i) and (ii) 
are valid. 

For (iii), take a DR^-transducer 21 = (S, X, A, fl, Y, P, oq) and a DLR-transducer 25 = 
{n,Y,B,A,Z,P',bo). 

Consider the RR-transducer (t = (S, X, yl x B,Q,Y, P", (oq, bo)), where P" is given in 
the following way: 

(I) If ap ^ q {a £ A,p £ X U Sq) Q £ Fq{Y)) is in P and bq r {b £ B,r £ F^{Z)) 
holds, then (a, b)p —>■ r is in P". 

(II) Suppose that {aa -A q{a.i^'^^ ,■■■■, D) (a £ A, a £ Sm, "i > 0, a* G A^\i = 

1,..., m, ni + ... + rim = n,q £ Fq{Y U H„)) is in P and there is a derivation 
bq r(bi^i,...,bm^^) with b £ B,hi £ 1 < 

j < ni,i = 1,... ,m and r £ F^(Z U H„). Then the production 

((a, b) -A r(aibi^”i,...,a^bmC")^-^^0 

is in P", where for every i(= 1, ..., m), 

D'iii) = P|(dom(r 2 t(ai.)) | iij (1 < j < rii) does not occur in r) D D{^i). 

Obviously, is a DR^-transducer. Moreover, for a\\ a £ A,b £ B,p £ Fy;{X) and 
r £ F/\{Z) the equivalence 

(a, b)p r {3q £ Ffi{Y)){ap =^^qAbq r) 

holds. This can be proved by induction on hg(p). Therefore, = raor®. Thus we have 
shown that VPr o VCR. C VTZr. 

To show (iv), let 21 = {Ti,X,A,fl,Y,P,ao) be a DR^-transducer and 25 = 
(n, Y, {6o}, A, Z, P', bo) an HF-transducer. 

Construct an RR-transducer = (S, X, A, A, Z, P",ao), where P” is given as follows: 

(I) ap ^ r {a £ A,p £ YoU X,r £ F/\{Z)) is in P" iff there is a production ap ^ q in 
P such that q bor holds. 

(II) Suppose that the production {aa —)> q{ai^^^,... ,am^^),D) {a £ A,a £ Ym,m > 
0, aj £ = 1,..., m, ni + ... + nm = n,q £ Fq{Y U H„)) is in P and there is a 
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derivation • ■ ■, bor{^^l ^,..., ,..., ,..., 

^ ^ = Cni+...+ni_i+j > 1 ^ J ^ j * = 1 > ■ ■ ■ j ^11 + ■ ■ ■ + ^Ini + 

... + = k,r € F^{Z U Hfc). Then the production 


mnm ' 


where 
■ + kml + 


{aa , • • •, 4^/ > ■ ■ ■ > am" 

is in P", where for every i(= 1,... 
but it does not occur in r) D D{^i). 


C^ml f^kmnm C^rnnm\ F)^ 

Sm ? • • • ? ^rrinm 


n(doni(T 2 i(ai^)) I occurs in q 


Using a similar argument as in the proof of (ii), we get that D'{^i) is a regular forest. 
It is obvious that d is deterministic. 

Finally, to show o r® = T(r it is enough to prove that for all a G G Fs(X) and 
r G F/\{Z) the equivalence 


ap r pT 2 i(a) bor 

holds. This can be done by induction on hg(p). □ 

From Theorem 14.4.71 we get 
Corollary 4.4.8 The inclusions 

(i) TZr o J'rel C 7^i^, 

(ii) VTZr o DFrel C VIZr, and 

(iii) VIZr o PT^rel C VTZr 

hold. □ 

Next we show that the classes of LF-transformations and LR^-transformations coin¬ 
cide. 


Theorem 4.4.9 CIZr = CF. 

Proof. Since VFxA C CAfF, the inclusion CTZr C CF is implied by Theorems 14.4.61 
Ciii. iro^ and imu fiii. 

In order to prove CF C CIZr, take an LF-transducer 21 = (C, X, A,Q,,Y, P, A'). Con¬ 
sider the RR-transducer 23 = {C,X,A,Il,Y,P',A'), where P' is given as follows: 

(i) If X —>■ ag (x G X, a G A, g G Fq{Y)) is in P, then ax ^ q is in P'. 

(ii) If fT(ai,... ,am) aq {a £ Cm,m > 0,ai,... ,am,a £ A,q £ Fq{Y UH^)) is in P, 
then {aa -£■ q{ai^i, ..., amim)-, D) is in P', where for every i{= 1,... , m), 

n(£ ) = I if does not occur in q, 

* \ Fs(X) otherwise. 
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Obviously, is an LR^-transducer. To prove = r® it is enough to show that for 
each a £ A,p & Fj:;{X) and q £ Fq{Y) the equivalence 

p aq ap q 

holds. Again, we omit the straightforward inductive proof. □ 

In the proof of the above theorem we used look-ahead to ensure that the LRr- 
transducer will not transform any tree which contains a subtree for which the LF- 
transducer has no transform but which it would later delete. 

From Theorem 14.4.91 by Theorem 14.3.61 (i), we get 

Corollary 4.4.10 CTZr is closed under composition. □ 

Next we show that TZr is closed under LR^-transformations and 'DTZn is closed under 
composition. 

Theorem 4.4.11 The following equations hold: 

(i) = 'TIr, 

(ii) VIZr o VTIr = VTZr. 

Proof. TZji o CTZr = TZr follows from Theorem 14.4.71 bv Theorem 14.4.91 
Since, for each S and X, the identity mapping on Fj]{X) is in VTZr, in order to prove 

(ii) it is enough to show the validity of the inclusion VTZr o VTZr C VTZr. 

By Theorem 14.4.61 (iii), the inclusion VTZr o VTZr C VTZr o VFrel o VTZ holds from 
which, using Corollary 14.4.81 fhi. we get VTZroVTZr C VTZroVTZ. This latter inclusion, 
by the proof of Lemma 14.3.101 implies VTZr o VTZr C VTZr oTi o VCTZ. Now, using 
Theorem 14.4.71 (iv), we get VTZr o VTZr C VTZr o VCTZ, from which by Theorem 14.4.71 

(iii) , we arrive at the desired inclusion VTZr o VTZr C VTZr. □ 

To end this section we prove the analogue of Theorem 14.3.121 
Theorem 4.4.12 TZr = 77 o CTZr. 

Proof. The inclusion Ti o CTZr C TZr directly follows from Theorem 14.4.111 (i). To show 
TZr C TL o CTZr, consider an R^-transducer 21 = {'C,X, A, X, Z,P, A'). Omit regular 
look-ahead in 21 and for the resulting R-transducer consider the H-transducer 2S and 
LR-transducer given in the proof of Lemma I4.3.1UI Now it is impossible to provide € 
with a suitable regular look-ahead in an obvious way since H-transducers do not preserve 
regularity. We shall solve this problem in the following way. 

Take the tree homomorphism h : F^{X) —>■ Fj^{X) given as follows: 

(i) hx{x) = X {x £ X), 

(ii) hmk{cr') = a{(i,(k+i, • • •, (o' € m > 0). 
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One can easily verify that for every p € Fs(X) the equality h{T<^(j))) = p holds, i.e., 
h{p') = p (for p', see the proof of Lemma [4. 3. 101) . 

Now replacing each production aa' r(ai^^,...,^ > 0, (au — 

?’(aiCr\- • • € P) in P” by {aa' r(ai^i,..., L>'), where D'{ii.) = 

h~^{D{I^i)) (i = 1,..., m, j = 1,..., /c), from Gl we get an LRR-transducer since, by The¬ 
orem [THISl preserves recognizability. Let us denote the resulting LR^-transducer 
also by Gl. 

Using tree induction, it is easy to prove that t% = t^o r^. □ 

4.5 GENERALIZED SYNTAX DIRECTED TRANSLATORS 

In studying certain properties of tree transformations it is technically useful to consider 
systems that translate trees into strings. Such systems are also of interest as mathemat¬ 
ical models of syntax directed translations of context-free languages. 

Definition 4.5.1 A generalized syntax directed translator (GSDT) is a system 21 = 
(S, X, A, y, P, A'), where 

(1) S is a ranked alphabet, 

(2) A is a unary ranked alphabet (the state set), 

(3) X and Y are alphabets, 

(4) A' C A is the set of initial states, and 

(5) P is a finite set of productions (or rewriting rules) of the following two types: 

(i) ax ^ w {a G A,x G X,w R*), 

(ii) aa ^ w {a G A, a G Em,rn > 0, rc € (R U AEm)*)- (Here AEm is treated as 
an alphabet; the elements of it are the trees of the form a^i with a G A and 
Ci € ^m-) 

For ap ^ w we shall use the notation (ap,w), too. Moreover, for any a € A, we put 
21(a) = {Y,X,A,Y,P,a). 

Next we define translations induced by a GSDT 21. To this end, we associate with 
each a G A and p G F^{X) a subset px'^^a as follows: 

(i) if p € (X U So), then = {w \ (ap, w) G P}; 

(ii) if p = a{pi,... ,Pm) (<7 G Tjm,m > 0), then for all 

(acr, wiaii^i)Ci^W 2 ... Wkai(^k)Cik'>^k+i) € P 

(1 < ij < m,j = Wk+i G R*) and u*. G (j = l,...,k) 

the word wiVi^W 2 ■ ■ ■ WkVi^Wk+i is in pr^i^a, and 

(iii) nothing is in any unless this follows from (i) and (ii). 
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Definition 4.5.2 Let 21 = {T,, X, A,Y, P, A') be a GSDT. Then the translation induced 
by 21 is the relation = {{p, w) \ p G Fj]{X),w € Y*,w G pT^,a for some a G A'}. 

The class of all translations induced by GSDTs will be denoted by G. 

For translations induced by GSDTs we give another definition showing how a transla¬ 
tion is carried out step by step. 

Let 21 be the GSDT of Definition 14.5.11 Take two words v,w G {Y LS AFj^{X U H))*. 
(Here again each element of AFy.{X U H) is considered a symbol, i.e., we ignore the fact 
that these elements are composed of simpler objects.) We say that v directly derives w 
in 21 , and write v =>21 w, A w can be obtained from v by 

(i) replacing an occurrence of ax {a G A, x G X) in v by the right side u; of a production 
ax from P, or 

(ii) replacing an occurrence of an aa{pi,... ,Pm) {a G A, a G rn > 0,pi,... ,Pm G 
Ps(X U H)) in V by wiai(^i')Pij^W 2 ■ ■ ■ Wkai(^k)Pik'>^k+i where 

acr wiai(^i)ChW 2 ■ ■■Wkaii^k)Cik'>^k+i {1 < ij < m,j = 1,... ,k,wi,.. .,Wk+i G Y*) 
is a production in P. 

Each application of a step (i) or (ii) is called a direct derivation in 21. A sequence 

V = vq =^2l Vk = w {k > 0,Vi G (Y L) AFy,{X U E))*, z = 0,..., fc) 

of consecutive direct derivations is a derivation of w from v in 21 , and k is the length of 
this derivation, ff w can be obtained from u by a derivation in 21 , then we write v w. 
Thus is the reflexive-transitive closure of =^ 21 - Again, we suppose that the notation 
V w implicitly includes a given derivation of w from v. 

Using the notation =>|^, the translation T 21 induced by a GSDT 21 = (S, X, A, Y, P, A') 
can be given by 

Ta = {{p,w) I P € Fy.{X),w G Y*,ap w for some a G A'}. 

The concept of a reordering of direct derivations in GSDTs can be defined in a similar 
way as in the case of an R-transducer. Moreover, different forms of derivations can be 
introduced in an obvious manner. 

Deterministic, linear, totally defined and nondeleting GSDTs are defined in a natural 
way. Moreover, a one-state totally defined deterministic GSDT is a GSDH-translator. 
The translation induced by a GSDH-translator is called a generalized syntax directed 
homomorphism (GSD homomorphism). The class of all GSD homomorphisms will be 
denoted by ^hom- 

Example 4.5.3 Let 25 = (S, {x}, {ftoj ^ 2 }, {yi, 2 / 2 }) E', 60 ) be a GSDT, where S = 
Si = {cj} and P' consists of the productions 

bocr bi^ib2Ci, 
her bi^i, 62 CJ 626 , 

bix yi, b 2 X 2 / 2 - 
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Then is deterministic, totally defined and nondeleting, but it is not linear. 

Take the treep = a{a{a{x))) and the word w = yiy 2 - Moreover, consider the derivation 

bia{a{x))b2(T{a{x)) bia{x)b2cr{a{x)) bixb2(T{a{x)) =^>3 
yib2cr{a{x)) =^<b yib2cr{x) =^<b yib2X yiy2 = w, 

i.e., T 5 ((p) = yd(T 2 i(p)), where 21 is the R-transducer of Example 14.1.61 One can easily 
show that the previous equality holds for every p G Fs({x}). □ 

The above relation generally holds between GSDTs and R-transducers as it is shown 
by 

Theorem 4.5.4 For each GSDT 21 = {T,,X,A,Y,P,A') there exist a ranked alphabet D 
and an R-transducer IB = (S, X, A, bl, Y, P', A') such that = {{p, yd((?)) | (p, q) G r®}. 
Moreover, if 21 is linear, deterministic, nondeleting or a GSDR-transducer, then OS 
can also be chosen, correspondingly, as a linear, deterministic, nondeleting or an RH- 
transducer. 

Conversely, for every R-transducer IB there exists a GSDT 21 such that {(p,yd(g)) | 
(p, g) G r^s} = ra. 4/25 is, respectively linear, deterministic, nondeleting or an RH- 
transducer, then 21 is linear, deterministic, nondeleting or a GSDR-translator. 

Proof. Let 21 = (S, X, A, Y, P, A') be a GSDT. To define 25, for each production ap ^ w 
{a G A,p G X U S, tc G (y U ^E)*) in P, let 0J[ap,w) be an operator with rank |t(;|. Let 
D be the resulting ranked alphabet. Moreover, P' is defined as follows: 

(i) If ap —)■ tc (a G A,p G X U So,rc G Y*) is in P and |rc| = k, then the production 
ap ^{ap,w){qi, ■■■Nk) (g* G = 1,... ,4) with yd(a;(ap^^)(gi,.. .,qk)) = u; is in 
P'. 

(ii) If OCT —>■ tc (a G M, (T G Ym,m > 0, re G (T U AEm)*) is in P with |rt;| = k, then 
the production aa —>■ oj(^aa,w){qii ■ ■ ■ ■,qk) {qi £ ^ C AEmP = 1,...,A:) satisfying 
yd(w(ao-,io)(9i) ■ ■ •, qk)) = rc is in P', where yd is taken over the frontier alphabet 
y U AEm. 

In order to prove pa = {(p, yd(g')) | (p, g) G r®} it is enough to show that, for all 
a G A,p £ Fy:{X) and w GY*, the equivalence 

opiate (3^ G Pn(y))(ap 4 A yd(gr) = tc) 

holds. This can be done in an obvious way by induction on hg(p). 

It is also obvious from the construction of 25 that the remaining conclusions of the 
first part of Theorem 14.5.41 hold, too. 

Conversely, consider an R-transducer 25 = {Y, X, B,D,Y, P', B'). The productions of 
the desired GSDT 21 = (S, X, B, Y, P, B') are given as follows: 

(I) For all b G B,p G X U Sq and q G Fq{Y), if bp ^ q is in P', then bp yd(g') is in 
P. 
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(II) For all b £ B,a ^ T^m {m > 0) and q G Fq{Y U BEm), if ba ^ q is in P', then 
ba —>■ yd(g') is in P, where yd is again taken over the alphabet Y U BE^- 

To prove = {{p,yd{q)) \ {p,q) G r^s} it is enough to show that the equivalence 

bp^^w {3q £ Fn{Y)){bp q Ayd{q) = w) 

holds for arbitrary b £ B,p £ Fj]{X) and w £Y*. This can be carried out by induction 
on hg(p). Moreover, the remaining conclusions of the second part of Theorem 14.5.41 are 
obviously valid. □ 

4.6 SURFACE FORESTS 

The images of regular forests under tree transformations are called surface forests. In 
this section we compare classes of surface forests belonging to different classes of tree 
transformations. 

Definition 4.6.1 Let /C be a class of tree transformations. A forest S C Fq{Y) is 
called a fC-surface forest if there exist a ranked alphabet S, a frontier alphabet X, a 
forest R £ Rec(S, A), and a /C-transformation r C Fs(A) x Fq{Y) such that S = Rt. 
The class of all /C-surface forests is denoted by Surf(/C). 

The following lemma is obvious. 

Lemma 4.6.2 If fC is a class of tree transformations which contains all identity trans¬ 
formations, then Rec is included as a subclass in Surf(/C). □ 

Of course, this lemma applies to all of the classes of tree transformations which we 
have considered {F, IZ, CF,R etc.). 

Next we characterize F-transformations preserving regularity. For this we should in¬ 
troduce some more terminology. 

Definition 4.6.3 A tree transformation r C Fs(A) x Fq{Y) is said to preserve regu¬ 
larity if Rt G Rec(0,y) whenever R £ Rec(S, A). Moreover, a class /C of tree transfor¬ 
mations preserves regularity if every t in fC preserves regularity. 

We say that an F-transducer 21 = {Ti,X,A,fl,Y,P,A') is connected if for each a £ A 
there are p £ Fs(-^) and q £ Fq{Y) such that p =>* aq holds. 

Definition 4.6.4 For eachp G Fj^{XL)En), pathj(p) (1 < f < n) is given in the following 
way: 

(i) if p G So U A, then pathj(p) = 0, 

(ii) if p = Ci, then pathj(p) = {e}, 

(iii) if p = (j 7 ^ i), then pathj(p) = 0, 
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(iv) if p = a{pi,... ,pm) (o- € > 0 ), then 

pathj(p) = {jwj I Wj G pathj(pj), j = 1 ,... ,m}. 

Thus, pathj(p) is a language over the alphabet {1,... ,m,}, where m is the maximal 
integer with 7 ^ 0 - 

Obviously, the elements of pathj(p) describe paths leading from the root of p to a leaf 
labelled by 

If pathj(p) consists of a single word, then Z(pathj(p)) denotes the length of this word. 
Lemma 4.6.5 LT preserves regularity. 

Proof. Since the F-transducer given in the proof of Lemma Id.l.lll is linear, by Theorem 
14.3.61 (i), it is enough to show that for each LF-transducer 21 = (S, X, A, T, P, j 4'), 
range(r 2 t) is regular. Without loss of generality, we may assume that 21 is connected. 
Consider the regular flT-grammar G = {A,0,Y, P', A'), where P' is given as follows: 

(i) E X ^ aq [x € X, a £ A, q £ Fq{Y)^ is in P, then a —>■ g is in P', 

(ii) if cr(ai,... ,am) aq [a £ Em,m > 0,ai,... ,am,a £ A,q £ Pq(TUH^)) is in P, 

then a q{ai ,..., am) is in P'. 

In order to prove the lemma it is enough to show that the equivalence 

(3p € Fs(^))(p ^21 «4) ( 1 ) 

holds for all a G A and q £ Fn(Y). 

(I) First we prove that the left side of ([T]) implies its right side. For this, assume that 
a q is valid. We shall proceed by induction on the length I of a =^q q. 

Let I = 1. Then a —)• g is in P', and the following two cases are possible: 

(la) There is a production x ^ aq [x £ X, a £ A, q £ Fq{Y)') . 

(l b) There is a production cj(oi, ..., Um) aq {a £ Ym, m > 0 , ai,..., am, a £ A) 

such that in q no auxiliary variables occur, i.e., q £ Fft(Y). 

In case (la) take p = x. 

In case (Ib), since 21 is connected, there are pi G Ps(X) and qi G Pq(Y) (i = 
l,...,m) such that pi aiqi hold. Now taking p = a{pi,. ■ ■ ,Pm) we have 
p = a{pi,... ,pm) =>a • • •, am4m) =^ 2 l aq{qi, ...,qm) = aq. 

Next, assume that I > 1 and that our statement has been proved for derivations of 
length less than 1. Then a q can be written in the form a ■ ■ ■ ,o,m) 

q{qi ,..., qm) = Q, where cr{ai ,..., am) —> aiq is in P for some a G Ym {m > 0) and 
O'i =^G 4* (1 ^ * ^ ^) if occurs in q. By the induction hypothesis, for all such i 
there exists a pi £ Fj:{X) with pi atqi. In the remaining cases, i.e., if does 
not occur in q, let pi £ Ps(X) and qi £ Fn{Y) (1 < z < m) be arbitrary such that 
Pi Then p = a{pi,... ,pm) satisfies p aq. 
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(II) Assume that p aq holds. We shall show by induction on hg(p) that the left 
side of m is also valid. If hg(p) = 0, then, by the choice of P', the right side of 
(fTj) obviously implies its left side. 

Now let p = a{pi,... ,pm) {o' G > 0), and assume that our statement 

has been proved for all trees from Fj]{X) with height less than hg(p). Moreover, 
let us write p aq in the form p a{aiqi,... ,amqm) aq{qi,... ,qm), 
where cr(ai,..., Um) aq is in P and pi aiqi {i = 1,..., m). Then, by the 
definition of P' and the induction hypothesis, we have a =^g Q{ai, ■ ■ ■ ,am) 
q{qi,...,qm) = q- □ 


From Lemma 14.6.51 using Theorems 14.2.71 and 14.4.91 respectively, we get the following 
results. 

Corollary 4.6.6 CP preserves regularity. □ 

Corollary 4.6.7 CPn preserves regularity. □ 

A state o G A of an F-transducer 21 = (S, X, A, fl, Y, P, A') is nondeleting if there exist 
two trees p € Fj:{X Li Hi) and q G Fq{Y U Hi) such that p(a^i) a'q for some a' G A' 
and occurs in q. Otherwise a is deleting. The state a is copying if there are two trees 
p G Fy.{X U Hi) and q G Fn{Y U Hi) such that p{a^i) =>* a'q for some a' G A' and |i 
occurs at least twice in q. 

Lemma 4.6.8 Let 21 = {'C,X,A,fl,Y,P,A') be a connected F-transducer. If pre¬ 
serves regularity and a ^ A is copying, then range(r 2 [(a)) is finite. 

Proof. Assume that preserves regularity. Let a G A be a copying state, and take two 
trees p G Fy:{X U Hi) and q G Fq{Y U H„) such that p{afi) =>* a'q{fi) where a' G A' 
and n > 1. Suppose that range(r 2 t(a)) is infinite. Then there is an s G range(r 2 i(a)) with 
hg(s) > fc • |A|, where k is the maximum of the heights of the right-hand sides of the 
productions in P. Let r £ Fy,{X) he a tree such that r =^* as. Since hg(s) > k ■ |A|, 
there are trees ri,r 2 G Fj:{X U Hi) and r^ G Fj^{X) such that the following conditions 
are satisfied; 

(i) ri{r2{r3)) = r, 

(ii) rs =^* bss, r 2 {bfi) =^* bs 2 and ri(6^i) asi for some b £ A, si, S 2 £ Fq{Y U Hi) 
and S 3 G Fn{Y), 

(hi) hg(s 2 ) > 0, and ^i occurs in si and S 2 , 

(iv) S 1 (S 2 (S 3 )) = s. 
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Therefore, for each i{= 0,1,...), there is a derivation pi = p(ri(r 2 (r 3 ))) a'q{t'^) = qi 

where = si (s2('S3)) (the powers T of any tree t G Ej](XUEi) are defined thus: 

and = t(T) for each i > 0). Obviously, hg(g'j) increases with i when i is large enough. 

Now consider the forest T = {pi \ i = 0,1,...}. Obviously, T is regular. Since 
preserves regularity, this implies that T' = Tt% is also regular. Take an OT-recognizer 
B = {B, n, Y, 13, B') with T' = T(B). Choose an 

i > [2k{hg{p{r)) + 1) + 2|B|)A:(hg(p(r)) + l). 

Then there exists a tree t G Fq{Y) with k(h.g{p{r)) + l) + \B\ < hg(t) < A:(hg(p(r)) + 
l) + 2\B\ such that 

q = q{t,t7-^) (2) 

is also in T'. To prove the lemma it is enough to show that there exist no j and a" G A' 
such that pj a"q. Suppose 

, * n Iij'm\ u— 

Pj ^ a q [t ) = a q 

holds, where a" G A', q' G Fq,{Y U H^), rs r 2 {bi^i) {bi,bi+i G A, 

si+i G Fn{Y U Hi), I = s} G Fq{Y)), ri{bj+i^i) 6j+2s}+2 {bj+2 G A, 

Sj +2 € Fn(Y U “i)), p{bj+ 2 ^i) a"s }+3 = a"q' and t' = s}+ 2 K+i(- • • Ui) •••))• 

By the choice of i, there exists au {2 <u<j + 3) such that occurs in 
• • • 5 '5j'+ 3 but does not occur in s^_i. Moreover, let u — 1 < ui < ... < 
Uv < j + 3 he a maximal sequence with 1 < hg(s„j) < ... < hg(s„„), where si = 
Si{s'i_i{-■ ■ (s'l)...)) {I = 1,... ,j+3). Then u > 2A:(hg(p(r)) + l)+2|B|. Taking into con¬ 
sideration that hg(t) > A:(hg(p(r)) -|-1) -|- \B\ (and \B\ > 1), for anl {2 < I < v), the word 
w forming pathi(( 7 ) is a subword of a word in pathi(s'_|_ 3 (s'_|_ 2 (... {s'ui) ■ ■ •)))• (Infor¬ 
mally speaking, this means that there is a word in pathi (sf^ 3 (s'_,_ 2 (... {s'ui) ■ ■ •))) going 
through the root of t.) Therefore, we have /(pathi(g)) -|-hg(t) > 2A:(hg(p(r)) -|-1) -|-2|B|. 
But, by dll) and the choice of t, /(pathi(g)) -|- hg(t) < 2k{h.g{p{r)) + l) + 2|B|, which is 
a contradiction. □ 

Lemma 4.6.9 Let 21 = {Y,X,A,D,Y,P,A') he a connected F-transducer such that for 
every copying state a G A, range(r 2 i(a)) is finite. Then 21 is equivalent to a linear F- 
transducer. 

Proof. Suppose that oi,..., are all the copying states of 21. Let Tj = range(r 2 i(aj)) 
{i = 1,... ,k). Moreover, set T = U(^* | z = 1,..., A;). By our assumptions, T is finite. 
Define an F-transducer 25 = (S, X, B, LI, Y, P', B'), where 

B = {A - {oi I z = 1,..., /c}) U |J({ai} xTi\i = l,...,k) 

and 

B' = { A' OA' xT)r\B. 

Moreover, P' is given as follows: 
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(i) U p ^ aq {p € 'EqU X) is in P and a = Ui for some i {1 < i < k), then p —> (a, q)q 
is in P'. If a 0 {ai,..., ak}, then p ^ aq itself is in P'. 


(ii) Let 


it(6i , . . . , bjn) ^ 7 ••• ■> Cm) 


{a G Em, m > 0,6i,..., bm, a G A,q € Fq{Y U Em)) be in P. We distinguish the 
following cases: 

(iia) The state a is deleting. Fix any q G Ffi{Y L) Em) such that every occurs 
at most once in q. Then P contains every linear production a{ci,... ,Cm) —>■ 
aq{^i,... ,Cm) such that 


f {bj,qj){qj gTi) if bj is copying and bj = ai, 
\ bj otherwise. 


(iib) The state a is nondeleting but not copying. Then all productions 

cr{ci,.. ) ^ aq{r]i, ...,Pm) 
are in P' where for each j(= 1,..., m), 


Ci = 


{bj,qj) {qj GTi) if bj is copying and bj = ai, 


otherwise 


and 


_ J Cj if Cj occurs at most once in q, 

I 4j(.~ ^ 2 {cj)) otherwise. 

(Observe that if ^j occurs at least twice in q, then bj is copying.) 
(iic) The state a is copying. Then P' contains all productions 

fT(ci,...,Cm) {a,q)q 

where q = q{pi, ..., rjm) and for each j(= 1,..., m), 

ibj,qj) {qj GTi) if bj is copying and bj = ai, 


Ci = 


otherwise 


and 


J qj if ^j occurs in q, 

y any fixed tree from Fq{Y) otherwise. 


(Note that bj is copying if ^j occurs in q.) 

This ends the construction of P'. Obviously, 55 is an LF-transducer. 

We show that 21 is equivalent to 55. 

(I) Assume that p aq {p G Fs(X), q G Fn{Y),a G A) holds. We prove that 
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(la) p aq if a is nondeleting but not copying, 

(l b) p (a, q)q if a is copying, 

(l c) p aq for some q € F^{Y) if a is deleting. 

We shall proceed by induction on hg(p). If hg(p) = 0 then, by (i), (la), (Ib) and 
(Ic) obviously hold. 

Next let p = a{pi,... ,pm) (c & E,m,m > 0), and write p aq in the more 
detailed form 

a{pi, ...,pm)^^ o-{biq[,.. ■,bmqln) ^ 2 t aq'{q[, ...,q'^)=aq 

where (t(6i, ..., bm) aq' is in P and for each j (1 < j < m), pj ^jlj- Then, 
by the induction hypothesis, for all j(= 1,..., m), we have pj ajqj, where 

(la') Cj = bj and qj = g' if bj is nondeleting and not copying, 

(Ib') Cj = {bj, qj) and qj = g' if bj is copying, 

(Ic') Cj = bj and qj = qj for some g^ G Tb(y) if bj is deleting. 

Therefore; 

(la") If a is nondeleting but not copying, then the production 

a{ci , . . . , C 777 , ) ^ aq'{r]i, ...,r]m) 

is in P', were r]j (j = 1,..., m) is given by (hb). 

(Ib") If a is copying then the production 

(t(ci, ...,Cm)^ {a,q')q' 

with g' = g'(? 7 i,..., r]m) is in P', were rjj (j = 1,..., m) is given by (iic). 

(Ic") If a is deleting then the production 

fT(ci,... ,Cm) aq' 

given by (iia) is in P'. 

Thus, in all three cases the required derivations in 53 exist. 


(II) Assume that one of the following relations hold: 

(l la) p aq or 

(l lb) p (a,g)g 

where p G Fj]{X), q G FniY) and a e A. 

Then, by reversing the above computation, one can show that the desired deriva¬ 
tions 
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(lie) p aq if a is nondeleting, 

(Ild) p aq for some q € Ffi{Y) if a is deleting 

exist. Since the final states are nondeleting, this ends the proof of the lemma. □ 


We can now state and prove 

Theorem 4.6.10 Let 21 = {Ti,X,A,fl,Y,P,A') be an arbitrary F-transducer. Then 
preserves regularity iff^ is equivalent to an LF-transducer. 

Proof. If 21 is equivalent to an LF-transducer then, by Lemma 14.6.51 preserves 
regularity. 

Conversely, let ra preserve regularity. We may assume that 21 is connected. Then by 
Lemmas 14.6.81 and 14.6.91 21 is equivalent to an LF-transducer. □ 

From Example 12.4.151 we directly obtain 

Theorem 4.6.11 Neither F nor TZ preserves regularity. □ 

The following result shows that Surf(J^) C Surf(7^). More precisely, we have 

Theorem 4.6.12 Surf(J') = Surf(7^) and Surf (77) is a proper subclass o/Surf(7^). 

Proof. The first statement of Theorem 14.6.121 follows from Theorem 14.3.31 and Lemma 
11X51 

It is obvious that Surf(77) C Surf(7^). We show that the inclusion is proper. For this, 
consider Example 14.1.61 Moreover, let S = {a; 2 (w”( 7 /i),cuf (7/2)) | u = 0, 1 ,...}. If 7? 
denotes the regular forest {cr{x)} -x {a{x)}*^, then 7?ra = S. Therefore, S € Surf(7^). 

Assume that for an HR-transducer 23 = {A, Z, {bo},Q,Y, P',bo) and regular forest 
T C Fj\(Z), we have S = Tr®. Then 23 can be chosen linear since in the opposite case 
in Tr<g there is a tree with at least two occurrences of a subtree. Therefore, by Theorem 
12.4.161 S is regular. But one can show similarly as in Example 12.4.151 that S is not 
regular. □ 

Next we show some closure properties of surface forests which will be needed also in 
Section 14.71 

Theorem 4.6.13 Let S G Surf (7^) and let T be a recognizable forest. Then S CiT £ 
Smf{F). 

Proof. Let Ti F Fy,{X) X Fq{Y) be an F-transformation and S = Rti where R € 
Rec(E,X). Take an arbitrary regular forest T C F^{Y). Denote by T2 C Fq{Y) x i^o(T) 
the DF-transformation given in the proof of Lemma f4.1.1 II which corresponds to T. Then 
Rti o t2 = S DT. But, by Theorem 14.3.91 ti o t2 is an F-transformation. □ 

For R-surface forests we have a similar result. 
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4 TREE TRANSDUCERS AND TREE TRANSEORMATIONS 


Theorem 4.6.14 The intersection of an R-surface forest with a regular forest is again 
an R-surface forest. 

Proof. The proof is similar to that of the previous theorem, but now we shall use the fact 
that the transformation given in the proof of Lemma 14.1.1 II is an LNR-transformation. 
Moreover, by Theorem 14.8.151 the composition of an R-transformation by an LNR- 
transformation is again an R-transformation. □ 


By Theorem 14.3.71 DT is closed under composition. Therefore, Surf(PJ^) is closed 
under DF-transformations. Although DTZ is not closed under composition, we shall 
show that Surf (217^) is closed under DR-transformations. For this, we need 

Theorem 4.6.15 Let 21 = {T,X, A,D,,Y, P,ao) and ^ = {D,,Y, B, A, Z, P',bo) be any 
DR-transducers. Then there exists a DR-transducer = (S, X, C, A, Z, P",co) such that 
for every R C Fj^{X), ST(r = Rt^ o r®, where S = RCi dom(T 2 i o r>g). 


Proof. Let C = A x B and cq = (ao,6o)- We want to define P" in such a way that 

whenever ap q [a ^ A,p ^ FY,{X),q € Fq{Y)) and bq r [b ^ B,r G Fa{Z)) 

hold, then (a, b)p r. If p € Sq U X, then {ap, q) G P. If we put the production 

(a, b)p — >■ r in P", (t will have the desired property for these a, b,p, q and r. 

Now let p = cr(pi,... ,Pm) (o' € ^m-, m > 0) and suppose 

ap = aa{pi,... ,pm) q{... ,aijPi,...) «(•••> •• •) = 4, 

where {aa,q{... ,aijii,. ..)) e P {q e Fq{Y U for some n) and aijPi qij, i.e., 
the considered copy of pi is translated by 21 starting in state Ojj into qij. Furthermore, 
suppose that applying to q the transducer 23 starting in b, we get 


bq = bq{...,qij,...) r{... ,bijiqij,... ,bijkqij,...) r(..., ..., ...) =r 

{bq r,r G Fa{Z U En),bijiqij riji,l = l,...,k) 

(meaning that the given occurrence of qij in q has k translations by starting the 
translations in states biji ,..., bijk). Thus, if we have the production 

{a, 6)(t >■ r {..., {aij, ..., {o^ij, bijifjf^i, • • •) 

in P" and suppose that C has the required property for trees with height less than hg(p), 
then (a, b)p r also holds. Accordingly, the formal definition of P" reads as follows: 

(i) The production {a,b)x —>■ r {{a,b) G C, a: G A, r G Fa{Z)) is in P" if there is an 
ax ^ q in P such that bq r. 

(ii) If the production aa —)■ ..., (o G A, cr G m > 0, a* G A^^i = 

1,... ,m, ni + Hm = n,q e Fq{Y U H„)) is in P and 


bq 




’(bii^ 
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{hij € iij = irii+...+ni_i+j, i = 1,... ,m, j = 1,... ,ni, n'li + ... + = 

n',r € Fa{Z U H„/)) holds, then the production 


{a,b)a r((a”“bii,..., ..., ( 0 ^ 7 ^b^i, 

in in P", where ki = + ... + (i = 1,... ,m). 


) ’Jmrim 


)& 


Obviously, is a DR-transducer. Moreover, to prove the theorem it is enough to show 
that for arbitrary (a, 6) E C,p E Fj:{X),q E Fq{Y) and r E F/^{Z), ap q and 
bq r jointly imply (a, b)p r. This can be proved by induction on hg(p). □ 


Let us note that the C constructed above may delete certain subtrees of input trees so 
that dom(r£) becomes larger than dom(r 2 i or®). 

If R in Theorem 14.6.151 is regular then, by Corollary 14.3.171 and Theorem 12.4.21 S is 
also regular. Thus we have 

Corollary 4.6.16 Surf(P77) is closed under DP-transformations. □ 


4.7 AUXILIARY CONCEPTS AND RESULTS 

In Section 14.31 it has been shown that neither P nor TZ is closed under composition. 
In the next section we shall prove that compositions of re F-transformations or re R- 
transformations lead to proper hierarchies when re assumes the values 0 , 1 , 2 ,.... 

The purpose of this section is to introduce concepts and present results needed in 
Section 14.81 

Let iF be a class of forests and S a class of tree transformations. Then S{K) denotes 
the class {Tr | T E iL, r E 5}. Moreover, yd5(iL) will stand for {yd(r) | T E S{K)}. 

Definition 4.7.1 Let S be a ranked alphabet and X an alphabet. Let / be a mapping 
which associates with each d E S U X a nonvoid recognizable forest C 
where (d) is a ranked alphabet consisting of unary operational symbols only. It is also 
supposed that 11 (d) is disjoint with S. 

Now define the mapping / from the set of all SX-forests into the set of subsets of 
l^sun(X) (n = |J(ll(d) I d E S U X)) in the following way: 

(i) if p E So U X, then J{p) = {q{p) \ q E Tp}, 

(ii) if p = (j(pi,... ,pm) {cr E Sm,?re > 0,pi,... ,Pm E Ts(X)), then 

Up) = • ,9m)) I 9 e E 7(p*),i = l,... ,rre}, and 

(iii) if T C Fs(X), then /(T) = (J {l{p) I P E T). 
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The mapping / is called a regular insertion. 

In the sequel we shall write simply / for /. 

The above regular insertion can be interpreted as follows; / inserts directly below each 
node of a tree p G Fy,{X) a unary tree from the regular forest if the label of the node 
in question is d. The insertion of means that the given node is unchanged. The name 
“regular insertion” is more expressive if trees are given in Polish prefix form. In this 
case / inserts a word from T^ directly before an occurrence of d in the word p. 

Lemma 4.7.2 Rec is closed under regular insertion. 

Proof. Let T C Fy,{X) be a regular forest and / a regular insertion given by f{d) = T^ 
{d € T,{JX,Td C FqCEi)). Consider a regular tree grammar G = {N,E,X,P,ao) 
given in normal form such that T(G) = T. Moreover, for every T^ {d £ E Li X) let 
n, Si, Oq) be a regular tree grammar in normal form generating Td. For 
each d £ E L) X and a £ N consider the tree grammar Gf = (A^^, fl, Si, (oq, a)), 
where A"^ = N'^ x {«} and 

pd = a) -£ w((6^, a)) \ a^ -£ u:{h^) £ P^} U {(a'^, a) ^ 6 I ^ 6 € 

Obviously, T{G‘^) = Td holds for each d £ELi X and a £ N. 

Assume that the sets of nonterminal symbols of the grammar G'^ (d G S U X) are 
pairwise disjoint and also disjoint with N and N x (SUX). Construct the tree grammar 
G' = {N', Eun,X, P', ao), where N' = U(Xa |dGSUX,aGX)UXuXx(SUX) 
and P' is given as follows: 

P' = {a —)• (oq, a) |aGX, dGSUX} 

U IJ [P^ - {(a"*, a) ^ ^1 I G X"*} I a G AT, d G S U X) 

U {(0“^, a) -£ (a, d)\a^ ^ii£ P^, a"* G X'^, a G X, d G S U X} 

U {(a,cr) a{ai,... ,am) \ a a{ai, ... ,0^) £ P,a £ Em,m > 0,a,ai,...,G X} 
U {(ft, d) — y d I ft — y d £ P, a £ X, d G Sq U X 

From the construction of G' it is obvious that the following statements are valid: 

(ia) For any production a -£■ a{ai ,..., am) £ P {cr £ Em, m > 0) and tree q £T„ there 
exists a derivation in G' 

ft (ftg, ft) g((ft'", ft)) ^ q{{a, a)) g(cr(fti,..., ftm)) G X'^). 

(ib) For any production a ^ d £ P {d £ Eq L) X) and tree q £ Td there exists a 
derivation in G', a ^ (uq, a) g((ft'^, ft)) ^ q{{a, d)) ^ d {a^ £ X'^). 

Conversely, 

(ii) for any a £ N and p £ PEun(X) each derivation a =^q, p should have the form 
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(iia) fl (flg , fl) (^1 ((O]^ , o)) ^n((®ri) ®)) ^ Q'n((Q') <7)) ^ 

• • • ) Q-m)) ^ P 

for some a —)■ cj(ai,..., Um) € P,qn & o' G Ti^jm > 0 and Og, ■ ■ ■ ,a'^ G 
or the form 

(iib) a ^ {aQ,a) gi((af,a)) ^ ... ^ qn{iai,a)) qn{{a,d)) qn{d) for some 
a ^ d ^ P,qn ^ T,d ^ TjoLI X and Og,..., G iV*^. 

Properties (ia), (ib), and (ii) obviously imply that T{G') = /(T). □ 

Lemma 4.7.3 Let K be a class of forests closed under regular insertion. Then P{K) 
is also closed under regular insertion. 

Proof. Let R G AT be an arbitrary SX-forest and take an R-transducer 2t = 
(S, X, 17, y, P, A'). Set S = Rt'^. Moreover, for every d G S U X take a unary 
operator jfdj and let / be the regular insertion given by f{d) = {#d(?i)}*^L 
First 'vve shall show that if 51 is a regular insertion for which g{d) = {d G 

Huy), then g{S) G n{K). 

Construct the R-transducer 55 = (S U {ifd \ d € S U X}, X, P, H U {#}, Y, P', A') with 
B = AU C, where C = {p \ p G (U(sub(g) | q is the right-hand side of a rule in P) — 
S)}. Moreover, P' is the union of the following ten sets of productions: 

Pi ={a^d o,ifd 0.^1 I a G yl, d G S U X}, 

P 2 ={a#d ^ a;(gi^i,..., | od —)■ (7 is in P for some 

dGSUX, oGR, g = w(gi,..., qm),^ £ ^m, rn > 0}, 

P 3 ={a#d —^ 9^1 I ad —)■ (7 is in P for some d G 'P,q = a, a' G A}, 

P 4 ={a#d qfi I ad —)■ (7 is in P for some dGSUX, aG ^,(7 = ti;G Hg}, 

P 5 ={a#d —^ 9^1 I ad —)■ (7 is in P for some dGSUX, aG ^,(7 = yGy}, 

Pe ={qifd #( 96 ), Wd a;(gi^i,... , 9 ^ 6 ), Wd ^ q^i I 

q = uj{qi,...,qm),u} GLlm, m>0}, 

P? ~{0'f,iifd ^ 0.f,iffd ^ a^j.^ 1 , Ojfj^d ^ ^ffd ^ 

yifd yifd yii I 1 < ^ < r(P),r(P) is the maximum of ranks 

of the operators appearing in the left-hand sides of productions from P, 
a G A.jUJ G Hg, 7 / G y}, 

Ps ={afia -G af,i \ a G A, a G Ym, m > 0,1 < i < m}, 

Pg ={idd -G u) \oj G Hg, d G S U X} and 
-Pro ={yd -Gy\y gY, dGSU X}. 
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One can easily see that works as follows: assume that for some a (z A,p Fj]{X) 
and q G Fn{Y) a derivation ap q exists. Let q' be a tree obtained by inserting 

in q arbitrary trees from {#(^ 1 )}*^^ below symbols from 14 U y. Then for a p' ^ f{p)^ 
ap' q' holds. Conversely, if for some a € A,p G Fy;{X),p' G f{p) and q' G 
a derivation ap' q' holds then there is a g G Ffi{Y) such that q' G g{q) and ap q. 

Now, consider an arbitrary regular insertion h (into OT-trees). For each d G 14 U T, 
there is a regular tree grammar Gd = {Nd, 14(d), Hi, Pd, {udo}) such that h{d) = T{Gd)- 
We may assume that every Gd is in normal form. Since 14(d) is unary, this means that 
the productions of Gd are of the form ad —)• 0 Jd{a'^) or ad G F'd, tdd G 14(d)). 

Furthermore we may assume that the sets Nd are pairwise disjoint. Now construct the 
R-transducer 

(j: = (i4u{#},y,c,A,y,p",c'') 

with 

C = |J(4Vrf I d G 14uy), C'= {odj dG i4uy} 

and 

A = IJ (14(d) I d G 14uy)ul4 (Ai = |J (14(d) | d G 14uy) U 14i, A^ = (m ^ 1)). 
Furthermore, P" is given as follows: 

(I) ad# ujdia'd^i) iad,a'd ^ ^d,^d S 14(d), d G 14 U y) is in P" if ad ^d{a!d) is in 
Pd- 

(II) a^jUJ .. .,ad^^Cm) is in P" for oj G Om,rn > 0, di, ..., d^ G 14 U y 

and a^; G N^ if —>■ Ci is in P^. 

(Ill) For each y €Y and ay G Ny, ayp ^ y is in P" if ay —is in Py. 

Obviously, C is an R-relabeling. Therefore, by Theorem 14.3.151 r® o = r is an R- 
transformation. Moreover, by the constructions of 53 and C, it is clear that the equality 
h{S) = f{R)T holds. □ 

In the next section we shall need 

Theorem 4.7.4 Let r : X* Y* be a mapping induced by a deterministic gsm and 
S a ranked alphabet. Then there exist a ranked alphabet 14 and a DRji-transducer 53 = 
(S, X, R, 14, y, P', 60 ) such that the equality yd(T)T = yd(Tr®) holds for every T C 
Fj:{X). 


Proof. Consider the deterministic gsm A = {X,A,Y,ao,P,A') inducing r. We shall 
show the existence of a ranked alphabet 14 and that of a DRu-transducer 53 = 
(S, X, B, 14, y, P', bg) such that for any p G Fe(X), 

(i) yd(pra 3 ) = yd(p)r if yd(p) G dom(T), and 


176 




4.7 Auxiliary concepts and results 


(ii) p € dom(r( 8 ) implies yd{p) G dom(T). 

These obviously will imply the validity of Theorem 14.7.41 

For each 01,02 G A, let T(oi, 02 ) denote the set of all such trees p G Fs(X) that 
oiyd(p) wa 2 holds for some w gY*. By Lemma [1.7.41 and Theorem 13.3.21 every 
T(oi, 02 ) = yd“^ (L(oi, 02 )) is a regular forest. Now let B = {Ax A) [J { 60 } (60 0 tY) 
and n = S U {uax | o G A, x G X}, where r{ujax) equals the length of the word w 
obtained from the production ax —>■ wa' G P {a G A'). (The ranks of symbols from S 
are unchanged.) Moreover, P' is given as follows; 

(I) For arbitrary m > 0, o G and oi, 02 ,..., Om+i G A, P' contains the production 
( (oi, Om+l)o' y (t((oi , 02 )^ 1 , • • • , (Offi) Offi+l )C m ),D) where D{^i) = T{ai, Oj+i) {i = 

1 ,... ,m). 

(II) If (T G So and a G A, then the production (a, a)a —>■ cr is in P'. 

(III) For arbitrary x G X and (ai, 02 ) G Ax A, P' contains the production ( 01 , 02 )® —)■ q, 
where oi® =>a wo ,2 {w G Y*) and q G Fn{Y) is a fixed tree with yd(( 7 ) = w (such 
q exists by the definition of coaix)- 

(IV) For arbitrary m > 0,a G S^ and oi,... ,Om+i G A, if oi = oq and Om+i G A', 
then the production {boa —>■ (t((oi, 02 )^ 1 ,..., (om, am+i)Cm), T)) is in P', where 
D{^i) = T(oi, Oj+i) (i = 1 ,..., m). 

(V) For arbitrary ® G V, if oq® =^a wcli {w G Y*) and oi G A 'then the production 
box ^ is in P', where q G Fn{Y) is a fixed tree with yd(g) = w (again, by the 
definition of coaox, such q exists). 

(VI) If Oq G a' and a G Sq, then the production boa —>■ o is in P'. 

In order to prove Theorem 14.7.41 it is enough to show that for arbitrary ( 01 , 02 ) G 
A X A,p G Fj:{X) and q G F^{Y) the implication 

(oi, 02 )^ ^’^q aiyd{p) yd{q)a 2 

holds. This can be carried out by induction on hg(p). □ 

We shall now introduce some more concepts that will be needed in the next section. 

Let 21 = (S, X, A, fl, Y, P, A') be an R-transducer. Take a tree p G Fy,{X) and a node 
d of p. Denote by s the subtree of p at this node d. Consider a state o and a derivation 
a : ap q {q G Fq{Y)). Suppose exactly k copies of this occurrence of s are created 
during a and that these are translated into the trees ti,... ,tk (S Fq{Y)) starting the 
translations, respectively, in states ai,...,Ofc. In the next definition we distinguish a 
sequence of these states which will be called the state-sequence of a at d. 

Definition 4.7.5 Let 21 = (E, X, A, D, Y, P, A') be an R-transducer. Take a derivation 

a:ap^*q {a G A,p G Fy,{X), q G Fq{Y)). 
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Let 4 be a node of p and s the subtree at this node d. Replace the given occurrence of 
s in p by and denote by r the resulting tree. Write a in the form 

ap = ar{s) =^* q{as'^) =^* q{t), 

where q G Fq{Y U H„),a G A^,ar q{a^i ),as^ t and t G Ffi{Y)"'. Denote by 
ttidi qi (a* G A, 4j G S U X) the production applied first in the derivation OjS =>* L 
(i = 1,..., n). Then a = (oi,..., an) is the state-sequence and 

(ai4i gi,... ,a„4„ <?n) 

is the production-sequence of a at d. 

Often we shall speak about the state-sequence and production-sequence of a at a 
subtree s. In such cases the node to which the given occurrence of s belongs will be clear 
from the context. 

We now define state-sequences for derivations in GSDTs. 

Definition 4.7.6 Let 21 = {E^X,A,Y,P^A') be a GSDT. Take a derivation 

a:ap^*w {a e A,p e Fs{X),w e Y*). 

Let 4 be a node of p and s the subtree of p at 4. Replace the given occurrence of s in p 
by and denote by r the resulting tree. Write a in the form 

ap = ar{s) WiaiSW2 ■ ■ ■ WnanSWn+l WlViW2 ■ ■ ■ WnVnWn+l, 

where ar =^* wiai^iW 2 ■ ■ ■ WnanCiWn+i {wi G Y *= 1,..., n -|- 1, ai,..., G A) and 
aiS =>* Vi {vi G Y*,i = 1,..., n). Then a = (ai,..., an) is the state-sequence of a at 4. 

Like in the case of R-transducers, we shall also speak about the state-sequence of a at 
the subtree s. 

Definition 4.7.7 Let 21 be an R-transducer 2t = {E, X, A,Q,,Y, P, A') [a GSDT 21 = 
{'E,X,A,Y,P,A')]. Then a derivation a : ap =^* q {a € A,p G Fj^{X),q G Fq{Y)) 
[f3 : ap =^* w {a G A,p G Fy,{X),w G T*)] is k-copying if for every node 4 of p the 
length of the state sequence of a [j3\ at 4 is at most k. Moreover, 21 is k-copying if every 
derivation a : ap q {p G FY,{X),q G Fq{Y)) [fd : ap =^* w {p G Fy,{X),w G T*)] 
with a G A' is fc-copying. Finally, 21 is finite-copying if it is fc-copying for some k. 

We shall use the notation for the class of all transformations induced by fc-copying 
R-transducers. Similarly, denotes the class of all transformations induced by k- 
copying GSDT’s. Moreover, 77/ and Gf will stand for the classes of transformations 
induced by finite-copying R-transducers and finite copying GSDT’s, respectively. Gor- 
responding notations will be used for the classes T>77, VQ etc. 

The next result shows that R-transformational languages can be studied through gen¬ 
eralized syntax directed translations. 
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Theorem 4.7.8 For every k-copying GSDT 21 = {Ti,X,A,Y,P,A') there exist a ranked 
alphabet O and a k-eopying H-transdueer 25 = {Y, X, A,k},Y, P', A') such that = 
{(p,yd(g)) I {p,q) e r®}. 

Conversely, for every k-eopying K-transducer 25 there exists a k-copying GSDT 21 such 
that ra = {{p,yd{q)) \ {p,q) G r®}. 

Proof. The R-transducer and GSDT constructed in the proof of Theoreni l4.5.4l obvionslv 
have the required properties. □ 

The following theorem gives sufficient conditions under which TZk[K) = PTZk{K) holds 
for a given class K of forests. 

Theorem 4.7.9 Let K be a class of forests closed under relabeling and regular insertion. 
Take an R-transducer 21 = {Y, X, A,kl,Y, P, A'), an R € K and a positive integer k. 
Then 

5 = {g G Fq{Y) I there is a k-copying derivation ap q for some a G A' and p G R} 
is in VUkiK). 

Proof. Since K is closed under regular insertion, we may assume that A' is a singleton. 
Indeed, in the opposite case enlarge 4. by a new state oq, S by a new unary operational 
symbol a and P by all productions aocr —>■ afi (o G A'). Let 21 be the resulting R- 
transducer with initial state oq, and let R = f{R) , where / is a regular insertion given 
by f{d) = {o'(Ci)} (d G X U S). Then R G K and t^{R) = T 2 i(R). Furthermore, 
a derivation ap q {a G A',p G R,q G Fq{Y)) is fc-copying if the corresponding 
derivation aoa{p) =>|j- q is /c-copying, and conversely. Thus, we shall assume that A' = 
{oo}. Now we introduce the alphabet 

X = {((aix,gi),..., {atx,qt)) \ t < k,x G X,aiX ^ qi G P {i = 1,... ,t)} 

and the ranked alphabet A with 

Am = {{{aia,qi),..., {ata,qt)) \ t<k,a G Ym,aia qi G P {i = l,...,t)} 

(m = 0,1,...). Gonsider the R-transducer 25 = (S, X, {6o}) A, X, P', bo) where P' con¬ 
sists of the productions 

box {{aix, qi ),..., {atx, qt)) (x G X, ((aix, qi),..., (atx, qt)) G X) 

and 

boa ((aicr, gi),..., {ata, qt)){bofi, • • •, bofm) 

{a G Ym, {iaia,qi),..., {ata,qt)) G Am,m = 0,1,...). 

Obviously, 25 is an R-relabeling which relabels trees in the following way: if cr G S 
[resp. X G X] is a label at a node d of a tree p G F^{X), then IB relabels d by a 
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sequence of productions ((aiu, qi ),..., {atcr, qt)) [resp. ((oix, gi),..., {atx, g*))] from P 
with length at most k. 

Next define an R-transducer C = (A, X, C, D, Y, P”, cq) with 

C = {(u; oi,..., ot) I 1 <u <t <k,ai ^ A (i = 1 ,..., t)} 
and Co = (l;oo). Moreover, P" is defined as follows: 

(i) For each (u; ai,..., aj) € C and {{aip,qi),... ,{atp,qt)) € Aq U X, 

{u;ai,... ,at)i{aip,qi),{atp,qt)) qu is in P". 

(ii) Let (tt; oi,..., a^) G C and ((aiu, gi),..., ( 0417 , g*)) G A^, (m > 0). Write 

{aia,qi) in the more detailed form ajU gi(a^lC^'^ ■ ■ {^j € A^ipj = 

1,... ,m,nii + .. . + nim = ni,qi G Fn{Y U = 1,... ,t). Then the production 

(u; ai,..., at){{aia, gi),..., {ata, qt)) 

(((^ 11 ) bl), . . . , (uiriui )•••) {{Uml i bm)) ■ ■ ■ j iPmnum i bm))'Cm”"') 

in is P', provided that riij + ... + ntj < k {j = 1,... ,m), where up = riij + ... + 
Uu-ij + I, hj = (aij,..., Sitj) and j = 

Obviously, Gi is a deterministic R-transducer. Furthermore, one can easily see the 
following connection between derivations in 21 and Gi: 

Let p G Ps(X) and g G Fn{Y) be arbitrary trees, and take a A;-copying derivation 

a : aop g. 

Consider the tree p with (p, p) G r® which is the result of relabeling each node d of p by 
the production-sequence of a at d. Then in Gi we have a derivation 

(3 : (l;ao)p g 

such that if a = (ai,..., an) (n < k) is the state-sequence of a at d then ((1; a),... , (n; a)) 
is the state-sequence of f3 at d. Conversely, if for a p' G Pa(X) and g' G Fq{Y) there is 
a derivation 

P' ■■ {Y,ao)p' q', 

then for the (uniquely determined) tree p' G Fs(X) with {p',p') G Tsg we have the 
derivation 

f / . :)t / 

a : aop ^21 4 • 

Moreover, the state-sequence of j3' at a node d of p' is of the form ((1; a'),..., (m; a')) 
(a' = .. ,a)n)) with m < k, and a' is the state-sequence of a' at d. Therefore, Gi 

is /c-copying and S = Rr® o re holds. Since K is closed under relabelings, this implies 
S^VPikiK). □ 

From Theorem 14.7.91 by Theorem 14.7.81 we get 
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Corollary 4.7.10 Let K he a class of forests elosed under relabeling and regular inser¬ 
tion. Take a GSDT 21 = (S, X, A, Y, P, A'), aT ^ K and a positive integer k. Then the 
language 

L = {w \ there is a k-eopying derivation ap =^* w for some a G A' and p gT} 

is in 'DQk{K). □ 

Three more language operations will be needed. 

Definition 4.7.11 Let X be an alphabet and ff ^ X a symbol. For each L C X*, 
res{L,jf) (regular substitution) denotes the language defined as follows: 

(i) if L = {e}, then res(L, ff) = , 

(ii) if L = {x} (x G X), then res(L, ff) = #*x#*, 

(iii) if L = {ux} {u G X*,x G X), then res(L, = res{u,f^)res{x,if), 

(iv) if L is arbitrary, then res(L, = |J(res(t(;, \ w G L). 

Theorem 4.7.12 Let K be a class of forests closed under regular insertion. For each 
R G K there exist a linear nondeleting GSDT 21 and a forest S G K such that 
res(yd(i?),#) = St^. 

Proof. Let R C Fj^(X), R G K, and denote yd(i?) by L. Let A = Ai = {(i|dGSUX} 
and let / be the regular insertion defined by f{d) = {d(Ci)}*^^ (d G S U X). Define the 
GSDT 2t = (D, X, {oo}, X U {#}, P, oq) with D = SU A (Di = Si U A, m / 1) 

so that 


P = {aox #aoCi,aox -G- ao6# | x G X} U {oocr ^ #ao6 | cr G So}U 

{oox —>■ X I X G X} U {a^a -G aoCi • • • ooCm | c G T,m, rn > 0}. 

Obviously, 21 is a linear nondeleting GSDT satisfying res(L, = /(ii)r 2 [. Moreover, by 
our assumptions, f{R) = S G K. □ 

Theorem 4.7.13 Let Y be an alphabet and if ^Y a symbol. Take a language L PY* 
and a class K of forests closed under relabeling and regular insertion, /f res(L,#) G 
Vg{K), then L G POfiK). 

Proof. Let res(L,i^) = Tr^ where 21 = {Y, X, A,Y U {if},P,ao) is a deterministic 
GSDT and T C Fs(X) is a forest from K. Moreover, let A = {ai, ... ,ak}. A 
word (G res(L,#),yi,,... G Y) is called proper if 

ni, n 2 ,..., Ur-i are pairwise distinct. 

Consider a derivation 

a : aop wibipiW2b2PiW3 ■ ■ ■ WsbsPiWs+i =^* W1V1W2V2W3 ... WgVsWs+i = w, 
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where p G T, pi is a subtree ofp, ( 6 i, 62 , ■ ■ ■ > &s) is the state-sequence of a at pi, bipi =^* Vi 
(i = 1,..., s) and rci,..., ui, ■ ■ ■ ,Vs G (yu{#})*. If tc is proper and bi = bj (i ^ j), 
then in Vi (and thus in Vj) at most one symbol from Y may occur. 

Now for each a G {tu > 0) take all pairs {a, M), where M is a matrix of type kxm 
whose elements are from Y U AEm U {e}. Moreover, let be a ranked alphabet with 
Oq = So and Dm = {(<7) M) \ a G S^} {m > 0). 

Let Y = {ui ,..., yi} and denote by Rj {i = 1,..., /c, j = 1,..., 1) the set of all trees 
P G Dj]{X) for which v G where v is the word obtained from the derivation 

aip V. Moreover, let Tu^i {i = 1,. ■ ■, k) he the forest of all trees p G Fj]{X) satisfying 
u G #*, where v is obtained again by the derivation aip v. 

By Theorems 14. 5. 4l and l3.,'I.2l and Corollarv l4.8.171 the Rj {i = 1,... ,k,j = !,...,/ + !) 
are recognizable forests. Therefore, there are SX-recognizers Aij = [Aij, aij, A[-) {i = 
1,..., A:, j = 1,..., / -b 1) with Aij = {Aij, S) such that T(Ajj) = Rj. Consider the 
DF-relabeling 53 = (E, X, B,D, X, P', B) where 

B = {(pdii,... ,paii+i,... ,paki, ■ ■ ■ ,paki+i) \ P G Te(X)}, 
and P' is given as follows: 

(i) For each x € X, the production 

X y {xCXll, • • • : ^ ■ • • ; • • • ; 


is in P'. 

(ii) For every u G Sq, the production 

a ) • • • ) ) ■ ■ ■ ) ) ■ ■ ■ ) )(T 


is in P'. 


(iii) For each a G S^ (m > 0) the productions 


a(bi,... ,bm) ^ b(cr,M)(^i,.. .,^rn) 

are in P', where b* = ..., ..., ..., b = 

{bii,...,bii+i,...,bki,...,bki+i) G B {t = l,...,m), bij = {b[f,..., b^^^^) 
{i = 1,... ,k,j = !,...,/ + !) and the element mu {i = 1,... ,k,t = 1,...,m) of 
matrix M is given by 


e 


mu = 


Vu 

(ki^t 


if I’M (= Km, 

if C* e (1 s “ £ 0, 

otherwise. 


Obviously, mu is well-defined since there are no two components and 

{I < i < k,l < ji,j 2 < ^ + 1,41 7 ^ 42 ) such that bf^^ G A'--^ and bf^^ G A[-^ both 
hold. 
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By the definition of 55, it relabels trees in the following way: take a tree p € Fy.{X), 
and let ... ,Pm) {"m > 0) be the subtree of p at a node d. The provides us with the 
information about which of the subtrees pi, ■ ■ ■ ,Pm is translated by 2t(aj) (i = 1 ,..., /c) 
into a word from {Y U {#})* with 

(I) no occurrence of letters from Y, 

(II) exactly one occurrence of letters from T, 

(Ilia) at least two occurrences of letters from T, or 

(Illb) the given subtree is not in dom(T 2 i(a.)). 

Next take the GSDT Gi = {Q,X,A,Y,P",ao) where P” is given as follows: 

(a) If ap ^ w [a £ A,p G X U € (T U {#})*) is in P, then the production 

obtained from ap ^ w hy replacing all occurrences of ^ in re by e will be in P". 

(b) Let aa ^ w [a G A, a G Jlm,rn > 0,r(; € (T U {#} U Ar.m)*) be in P. Then 
all productions a{a,M) w' are in P” where w' is the result of replacing all 
occurrences of in w by rriij {f < i < k,l < j < m) and all occurrences of # 
by e. 

It is clear that C is deterministic. Moreover, one can show by induction on hg(p) for 
arbitrary a G A,p G Ft.{X) and w G (Y U{^})* the implication 

ap^%w aT<s{p) ip{w) 

holds, where (p : iY VJ {#})* —>■ Y* is the homomorphism given by p{y) = y {y & Y) and 
(p{#) = e. Thus 


L = {w' gY* \ aoT^{p) w', aop (1) 

p GT,w G {Y U {#})* and w is proper if |r(;'| > 2}. 

Furthermore, by our remark concerning state-sequences of derivations yielding proper 
words and the construction of C, the elements of a state-sequence of a derivation 
<^oT<s{p) w' from ([1]) are different at any node of r®(p). Therefore, since € has 
k elements, each element of L can be obtained by a fe-copying derivation in (t. Finally, 
since by our assumptions Tr® G K, using Corollary 14.7.101 we get L G PQk{K). □ 

Definition 4.7.14 Let X be an alphabet and # ^ X a symbol. For each language 
L C X*, the language c^{L, is defined by 

c*{L, #) = {{w#)"- \ w gL, n = 1, 2,...}. 

Theorem 4.7.15 Let K be a class of forests closed under regular insertion. For each 
R G K there exist a DGSDT 21 and a forest S G K such that c*(yd(i?), #) = St^. 
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Proof. Suppose R C Fj]{X) and let L = yd{R). We introduce the ranked alpha¬ 
bet A = Ai = {d I d G S UX} and define a regular insertion / by /(d) = {d(^i)}*^i 
(d G SUX). Moreover, let D be the ranked alphabet for which fli = Si U A and 
Dm = '^m {m > 0, m 7 ^ 1). Consider the GSDT 


21= (0,X,{ai,a2 },XU{#},P,ai) 


where 


P = {aid —)■ 01 ^ 102 ^ 1 # I d G S U X} 

U {a 2 d —)■ 02.^1 I d G S U X} 

U {aix ^ e I x G X} U {aid —>■ e | ci G Em, m >0} 

U {a 2 X ^ x I X G X} U {a 2 d — 02^1 ... 02 /™ | a G Em, m > 0}. 

It is obvious that 21 is a deterministic GSDT satisfying c^{L, jf) = S'ra, where S = f{R). 
Moreover, by our assumptions S a K. □ 

Theorem 4.7.16 Let U C c*(-L, 7 /) {L C Z*, # ^ Z) be a language containing infinitely 
many words for each w G L. Furthermore, let K be a class of forests elosed under 

relabeling and regular insertion. IfU& VQf{TZ{K)^, then L G VQ{K). 

Proof. Let 21 = {E,X,A,D,Y,P, A') be an R-transducer and = {D,Y,B,Z U 
{/t}, P', 60 ) a /c-copying deterministic GSDT. Moreover, take a forest R C Ps(X) from 
K satisfying U = (Pr 2 i)riB. Since K is closed under regular insertion, we may, without 
any loss of generality, assume that A' is a singleton, say A' = {ao}. First we shall 
construct an R-transducer 21 = {E, X, A,D,Y, P,aQ) which translates every p G Ps(X) 
into a tree q G Fq{Y) in the same way as 21 provided that q G dom(rig). In addition, if 
during the translation of p into q by 21 , an occurrence of a subtree p' in p is translated 
starting in a state a into a tree q', then during the corresponding translation of p by 
21 , p' will be translated starting in a state consisting of a and the state-sequence of the 
derivation of g in 2S at the subtree q'. Thus, 21 will have the property that if during 
the above translation of p by 21 , two copies of an occurrence of p' are translated starting 
in states ai and 02 , respectively, into the trees qi and q 2 such that ai = 02 , then the 
state-sequences of the derivation of (7 in 23 at qi and q 2 coincide. 

Let T<s{q) = (tc#)™ {w G Z*). If m is large enough, then the properties of 21 will 
make it possible to replace in a derivation aop q different derivations of p' starting 
from the same state by one of them such that for the resulting output tree q we shall 
have r(g(g) = with m' > m. By prescribing the applications of productions of 

21 in this manner we shall arrive at a DR-transducer 2li such that contains 

infinitely many words for each w £ L and S is obtained from P by a relabeling. 

Afterwards applying a deterministic gsm to we shall get L. 

Thus construct the R-transducer 21 = {E, X, A,D,Y, P,ao) where 

A = {(a, b) I a G A, b G P", n = 0,1,... ,k} 
and ao = (ao, ( 60 )) • Moreover, P is given in the following way: 
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(i) Let ap ^ q (a G A, p G X U Sq, q G Fq{Y)) be in P and take a vector b G 
(0 < n < /c). Then the production (a, h)p —)■ g is in P. 

(ii) Let aa ..., [a G A, a G m > 0, a* G i = 

ni + ... + Um = n, q G Fn{Y U H„)) be in P and b = {bi,... ,bs) G . Moreover, 

for every tt (1 < u < s), and every j (1 < j < n) take the derivation 

buQ Wujlbujl^jWuj2 ■ ■ ■ '^ujujbujuj^j‘^uju^ + 1 
{wuji ) • • •) G (Z U U Bij^n ~ {Cj})) ) buji , • • •, buj^^ G i?). 

Set bj = ( 6 ij^,..., , • • •, , • • •, bsjs .) 0 = 1 > • • • > ^) • Then the production 

(a,b)(T ^ g(((aii,bi),..., (aini,bnj)^”i, ((a 2 i,bnj+i),... 

• • • ) (02n2 ) bjj]^+n 2 ))? 2 ^ ’ • • • > ((®mi) ^ni+...+nm-i+l) 7 ■ ■ ■ 7 iflmrim 7 ^n))Crtr ') 

is in P, provided that for each j = 1,... ,n the length of the sequence hj is not 
greater than k. 

From the construction of 21, one can easily see the following connection between 21 and 
21. Take a tree p G Fj]{X), a node d of p and let p' be the subtree of p at d. Moreover, 
write p = r{p') (r G Fy,{X U Hi)), and consider a derivation 

a ; aor(p') q{ap'^) q{t) = q 
{q G Fn{Y), aor ^(a^f), q G k{Y U H^, ap'^ t, t G Fn{Yr) 

with q G dom(T®). Then in 21 we have a derivation 

/3 : {ao,{bo))r{p') g(((ai, bi),..., (a^, b„))p'”) q{t) = q, 

where bj (1 < i < n) is the state-sequence of the derivation 

7 : boq^^w {g{ZU {#})*) 

at the subtree ti. Therefore, if (ai,bj) = (opbj) (1 < i,j < n), then the state-sequences 
of 7 at the subtrees ti and tj coincide. We can assume that 21 itself has this property, 
because the equality o r® = o obviously holds. 

Consider a word G with m > 2k + 1. More exactly, let p G i? be a 

tree for which under the derivation a^p q { G Fq(Y)) the equality T®(g) = 
holds. Let r G Fy,{X U Hi) and p' G Fy.{X) with r{p') = p. Moreover, write the above 
derivation in the form 

a' : aor(p') q{ap'^) q{t) = q 
{q G Fn{Y), aor g(a^f), q G Pn{Y U H„), ap'- t, t G Fn{Yr). 

Assume that a state a G A occurs more than once in a, and let 07 ,..., Ui. {1 < ii < 
... < ij < n) he all occurrences of a in a. Then the state-sequences of 

/?': boq^liw#r (g(^U {#})*) 
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at the subtrees ,..., T. coincide. Let {bi,..., bg) be this common state-sequence. 

Among Li,... ,ti- let tq be the tree for which has a maximal 

number of occurrences of Replace the considered occurrences of tq,... ,ti. in q by 
tq, and denote by q' the resulting tree. We claim that for q' we have Ts^iq') = 
with m' > m. To prove it let us distinguish the following two cases: 

(I) There exists an r (1 < r < s) such that ^ occurs at least twice in the word 
Then our claim obviously holds. 

(II) ^ occurs at most once in each word Take a fixed r 

(1 < r < j), and write f3' in the form 

boq wibi%W2 ... Wsbs%Ws+i 
W 1 V 1 W 2 ■ ■ ■ WsVgWs+l = ('(i’#)™- 

Since m> 2k+ 1 and s < k, there exists atCu(l<u<s-|-l) such that # occurs 
at least twice in Wu- This also implies our claim. 

Thus we have got the following result. If we replace in a' every subderivation 
ttrp' tr (ur = a,r = R,... ,ij) by ap' tq, then b^q' with m' > m 

holds for the resulting output tree q'. Therefore, prescribing the applications of the 
productions of 2t in this way, we arrive at a deterministic R-transformation whose com¬ 
position by Ttg, applied to a suitable forest from K, for each w € L yields infinitely many 
words {m > 1), and only such words. Next we show how this can be carried out. 

First we define a deterministic R-transducer 2ti. 

Let A = {ai...,as}, and define a set X of variables by 

X = {{x, (ci,... ,Cs)) \ X £ X, a = {aiX,qi) £ P or a = *, i = ,s} 

where * is a new symbol. Moreover, define the ranked alphabet A, where for each 
m (> 0), 

Am = {(fJ, (ci,... , 0 ^)) I a £ Sm, Ci = {aia,qi) £ P or a = *, f = 1,... ,s}. 


Now take the R-transducer 2ti = (A, X, A, II, Y, Pi, oq) for which Pi is given as follows: 


(a) For each ai £ A and (x, (ci. 

..., C5)) £ X, if Ci = (ttiX, qi), then the production 


ai{x, (ci,... ,Cs)) qi 

is in Pi. 


(P) For each ai £ A and (a, (ci. 

..., Cs)) £ Am, if Ci = (ajCJ, qi), then the production 


aj(cj, (ci,.. .,Cs)) qi 


is in Pi. 
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Obviously, 2li is a deterministic R-transducer. 

Next, let D = (L, X, {do}, A, X, P", do) be the F-relabeling where 

P" = [x ^ do{x, (ci,... ,Cs)) I X € X, (x, (ci,... ,Cs)) € X}U 
{a{do, ...,do)^ do(a, (ci,... ,Cs))(6 1 ■ ■ ■ 1 Cm) I ^ 

(a, (ci,... ,Cs)) G Am, m > 0}. 

Put S = Rr^. Since K is closed under relabeling, S G K. Moreover, taking into 
consideration the remarks preceding the construction of 2ti, one can easily see that, for 
each w G L, (5'Taj)r<s contains infinitely many words of the form (tc#)'" (m > 1), and 
only such words. 

Finally, take the deterministic gsm C = (Z U {#}, {cq, ci}, Z, co,Pc, {ci}) where 

Pc = {coz —>■ zco \ z G Z}[J {co# ^ eci} U {ciz —)• eci \zGZU {#}}. 

Obviously, (rc#)™'Tc = w for all w G Z* and m > 1. 

Denote by *Bi the deterministic A:-copying R-transducer obtained from by Theorems 
14.5.41 and 14.7.81 Moreover, let Gii be the DR^-transducer given to C by Theorem 14.7.41 
Then the equality L = yd(S'r 2 ti o o holds. Thus, by a repeated application 
of Theorem 14.4.61 (hi) and Corollary 14.4.81 (ii) and using Theorem 14.6.151 and Corollary 
14.3.171 we get for a suitable deterministic R-transformation r and a suitable T G K the 
equality Tr = St%^ oTe;^. (Observe that the F-transducer 21 given in Lemma fd. 1.111 
is an F-relabeling. Hence, closure under relabeling implies closure under intersection 
with regular forests.) Finally, again by Theorem 14.5.41 we have L G PQ{T). □ 

Definition 4.7.17 Let X be an alphabet and # ^ X a symbol. Then for L C X* the 
language C 2 {L, #) is defined by C 2 (L, #) = {w^w \ w G L}. 

Theorem 4.7.18 Let K be a class of forests closed under relabeling and regular inser¬ 
tion. If R G K, then there exist a 2-copying GSDH-transducer 21 and a forest T G K 
such that C 2 {yd{R),ff) = Tt%. 

Proof. Suppose R C F^{X) and let L = yd(i?). Moreover, take the ranked alphabet 
A = Ai = {d|dGSU A}, and consider the regular insertion defined by /(d) = 
{d(^i)}*^i (d € S U X), and set S = f{R). Then S G K. Finally, let D = S U A be the 
ranked alphabet with Hi = Si U A and Llm = {m >0, m 1). 

Now consider the R-relabeling 25 = {Ll, X, {bo,bi},Ll, X, P,bo), where 

P = [bod -G d{biCi) I d € S U A}U 
{bia (7{biCi,..., biCm) I cr G S^, m > 0}U 

{6ix —>■ X I X G X}. 

Obviously, T = Sng consists of all trees of the form d(r), where r G R and 
d = root(r). Since 25 is a relabeling, T G K. Now we construct the required GSDT 
21 = (0, A, {ao}, A U {#}, P', ao), where 

P' = {aod —7> ao/i#aoCi | d G S U A}U 
{aoiT ^ aoCi ■ ■ ■ aoCm \ <7 G S^, m > 0} U {aox —>■ x | x G A}. 

It is clear that 21 is a 2-copying GSDH-transducer and that C 2 {L, ff) = Tra holds. □ 


187 




















4 TREE TRANSDUCERS AND TREE TRANSFORMATIONS 


Theorem 4.7.19 Let Y be an alphabet and ^ a symbol. Take a language L C Y* 
and a class K afforests closed under relabeling and regular insertion. Ifc 2 {L, ff) G Q{K), 
then L G DQ{K). 

Proof. The idea behind the proof is similar to that of Theorem 14.7.161 but this is much 
simpler. 

Let 21 = (S,X, yl, y U {ff}, P, A') be a GSDT and R ^ K a SX-forest such that 
Rt^i = C 2 {L, ff). Since K is closed under regular insertion, we may assume that A' is a 
singleton, say A' = {uo}. 

Take a tree p G 4?, a subtree p' of p and let p = r{p') (r G .Fs(2f U Hi)). Consider a 
derivation 

a : aor(p') =^* wiaip'w 2 ... Wkakp'wk+i =>* wiViW 2 ... WkVkWk+i = wffw, 

where aor(^i) =>* ... Wkakf.iWk+i, wi,..., Wk+i,vi, ...,Vk G (T U {#})* and 

Oip' =>* Vi (i = 1,... ,k). Then (ai,..., a^) is the state-sequence of a at p'. Assume that 
a state a G A occurs at least twice in (oi,..., Ok), and let aq and Ojj (1 < A < ^2 < k) 
be two such occurrences of a. Then, taking the relevant occurrences of Vi^ and Ujj in 
wffw, we have the decomposition wffw = On the other hand the words 

uiVijU 2 VijU 3 (j = 1,2) are also in Rt^i. Hence, uq = Vi^ must hold. This implies that 
if we replace for each t {1 < t < k) such that at = a, atp' =^* vt by atp' =>* uq, we get 
the same word wffw. Therefore, prescribing accordingly the applications of productions 
from P, we arrive at a deterministic GSDT yielding C 2 (L, ff). This can be carried out in 
the same way as in the proof of Theorem 14. 7. 161 but here the resulting 2ti is a DGSDT. 
Thus, taking the F-relabeling S) defined in the proof of Theorem 14.7.161 for S = 
we have S € K and S’rsti = C 2 {L,ff). Moreover, by Theorem 14.5.41 there exists a DR- 
transducer with C 2 {L,ff) = yd(S'rtBj. Finally, consider the deterministic gsm C of 
the proof of Theorem 14. 7. 161 with Y instead of Z, and let Ci be the corresponding DRr- 
transducer. Then the equality L = yd(5T®^ o Te;^) holds. Thus, by Theorem 14.4.61 (hi), 
Gorollarv l4.4.8l fiiL Theorem l4.6.15l and Gorollarv l4.3.171 for suitable DR-transformation 
r and a T G iF, we get Tr = oThis, by Theorem 14.5.41 implies L G VQ{T). □ 

4.8 THE HIERARCHIES OF TREE TRANSFORMATIONS, 
SURFACE FORESTS AND TRANSFORMATIONAL 
LANGUAGES 

In this section we prove that the compositions of n F-transformations or n R- 
transformations form proper hierarchies when n = 0,1,2,.... Similar results will be 
shown for the classes of forests (n-surface forests) which can be obtained from regular 
forests by compositions of n F- or n R-transformations. All these results will follow from 
the fact that the classes of languages (re-transformational languages) obtained by taking 
the yields of re-surface forests form a proper hierarchy. 

Definition 4.8.1 A forest T is an (n,R)-surface forest if T G Surf(7F"'). (re, F)- and 
(re, RR)-srer/ace forests are defined in a similar way. 
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Definition 4.8.2 A (string) language L is an {n,Ii)-transformational language if 
L = yd(T) for some (n, R)-surface forest T. (n,F)- and {n,Rji)-transformational lan¬ 
guages are defined similarly. 

If n = 1 then we shall speak about R-, F- and Rn-transformational languages, as well. 

The following results show that in studying (n, R)-surface forests and (n, R)- 
transformational languages we can use RR-transformations, too. 

Theorem 4.8.3 For each natural number n, the equality Surf (7^”) = Surf(7^^) holds. 
Proof. This follows from Theorems 14.4.61 fil and 14.3.1^ and Lemma 14.6.51 □ 

From Theorem 14.8.31 we directly get 

Corollary 4.8.4 For every natural number n, the class of {n,R)-transformational lan¬ 
guages coincides with the class of {n,K^)-transformational languages. □ 

Using Theorems 14.4.71 lil and 14.2.71 from Theorem 14.8.31 we obtain 

Corollary 4.8.5 For every natural number n, Surf(7^”) is closed under LF- 
transformations and LR-transformations. □ 

Now we can state and prove a result giving a recursive procedure by which the hierarchy 
theorems can be proved easily. The procedure will be based on the “bridge theorems” of 
the previous section which concern the operations res, C 2 and c*. These associate with 
each language which is not in a given class another language which is not in another, 
larger class. 

Theorem 4.8.6 Let K he a class afforests closed under relabeling and regular insertion. 
If ydVTZf{K) C yd7^(iF), then for each integer n > 1, 

yd7^”(A:) C ydP7^/(7^”(A:)) C ydVn{lZ^{K)) C ydlZ^+^K). 

Proof. By Theorem 14.3.151 and Lemma 14.7.31 77” (7L) is closed under relabeling and 
regular insertion, for every n > 1. In the sequel these facts will be used without further 
mention. 

We shall proceed by induction on n. Let n = 1. Take a forest R such that R G 77(7L) 
and yd(72) 0 ydVTZf{K). Then by Theorems 14. 7. 12114. 5. 41 and 14. 2. 81 there exist an LNF- 
transformation r and a forest S € 77(7L) such that res(yd(7?), = yd(5r). Moreover, 

by Theorem 14.3.151 St G TZ{K). On the other hand, since yd(72) ^ ydVIZf{K), by 
Theorems 14.7.131 and 14.5.41 res(yd(72), #) ^ ydT>IZ{K). Thus, the proper inclusion 
ydVn{K) c yd77(A:) holds. 

Next take an 72 G 77(77) with yd(72) ^ yd7777(77). Then, by Theorems 14.7.181 and 
14.7.81 there exist a 2-copying homomorphism r and a forest S G 77(77) such that 
C 2 (yd(72), #) = yd(S'r). On the other hand, since yd(72) 0 ydP77(77), by Theorems l4.5.4l 
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and l4.7.191 C2(yd(i2), #) ^ ydTZ{K). Therefore, the inclusion yd77.(il') C yd'DTZf{TZ{K)) 
is valid. 

Again take an 4? € R-iK) with yd(i?) 0 yd'DTZ{K). By Theorems l4.7.15l and l4.5.4l there 
exist a DR-transformation r and a forest S € R-iK) such that, c*(yd(R), #) = yd(iS'r). 
Moreover, since yd(i?) 0 yd'DTZ{K), by Theorems 14.7.161 and 14.7.81 c*(yd(R),#) 0 
ydVTZf{TZ{K)). Thus we have got that 

ydVnf{n{K)) c ydT>7^(7^(A:)). 

Finally, take an 4? € TZ^{K) with yd(4?) ^ ydPT^j(7^(4F)). Then again by Theorems 
I4.7.12l and l4.5.4[ there exist an LNF-transformation r and a forest S € R?{K) such that 
res(yd(44), = yd(5r). Moreover, by Theorem 14.3.151 St G R?{K). On the other 

hand, since yd(4?) ^ ydVTZf(TZ{K)) ^ by Theorems 14.7.131 and 14.7.81 res(yd(4?), #) 0 
ydP7^(7^(4^)). Therefore, ydP4^(7^(4^)) CydR?{K). 

Summarizing our results, we have 

ydn{K) c yd4:>7^/(7^(A:)) c ydP7^(7^(4^)) c ydR?{K) 
which completes the proof for n = 1. 

The transition from n to n + 1 is illustrated by Fig. 14.41 □ 



Figure 4.4. 

According to Theorem 14.8.61 to show that the classes of (n,R)-transformational lan¬ 
guages form a proper hierarchy it is enough to prove the properness of the inclusion 
ydP7^/-(Rec) C yd77.(Rec). For this we need 

Lemma 4.8.7 For each k-copying DGSDT 21 = (S, A, A, T, P, oq) there exists a lin¬ 
ear DGSDT 23 = {E, X, B,Y, P',b(}) such that Par(rr®) = Par(TT 2 i), for every forest 
TCFj^iX). 

Proof. For each w (z {Y U AH)*, let w denote the word obtained from w by erasing all 
a^’s (a € A, ^ € H). 

Let B = {(oi,... ,a„) \ n < k, G A{i = 1,... ,n)} and bo = (oq). Moreover, P' is 
defined in the following way: 
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(i) Let a = (oi,...,a„) G B and x G X be arbitrary. Assume that the produc¬ 
tions UiX —>■ Vi (oj ^ A, Vi ^ Y*, i = 1,... ,n) are in P. Then the production 
ax —>■ ui ... is in P'. 

(ii) Take an arbitrary a = (ai,..., Un) G B and a G (m > 0). Suppose P contains, 
for each i = 1 ,..., n, a production 

OjCT —)■ Wij^aij^^jWij2 ■ ■ ■ Wij-,aij^,^jWij^,_^_^^ = Wi 

(wij^ ,..., G {Y U , Uij^ ) ■ ■ ■) G A, 1 < j < m). 

Then the production 

acr >■ ) • • • ) 1 ■ ■ ■ 1 ®nli) ■ ■ ■ ) )?1 • • • 

...(ai 

mi) • • • ) Olmi^ ) • • ■ ) Q-nmi j ■ ■ • ; 0,nmn^ )Cm'Wl ■ ■ ■ '^n 

is in P', provided that Ij < k (j = 1,..., m). 

Obviously, 53 is a linear DGSDT. Moreover, the derivations in 21 and in IB are related 
as follows. Take a vector a G A” (n < k) and a tree p G Fy,{X). Consider the derivations 
a : ap^ where w = wi... Wn & Y* and ai : Uip Wi (i = 1,..., n). By the 

state-sequence of a at a node d of p we mean (ai,..., a„), where a* (1 < i < n) is the 
state-sequence of ai at d. Furthermore, we say that a is /c-copying if the length of the 
state-sequence of a at any node of p is at most k. Assume that a is /c-copying. Then for 
some w' G Y*, (3: ap"^ exists. One can easily show by induction on hg(p) that 

the state-sequence of (3 at any node d of p is of length one (if it exists) and coincides, as 
a sequence of states of 21, with the state-sequence of a at d. Finally, rc is a permutation 
of w'. Therefore, the equality Par(TT 2 t) = Par(Tr(g) holds. □ 

From Lemma 14.8.71 by Theorems 1 1.6.1 7l and and Corollary 14.6.61 we get 

Corollary 4.8.8 Let T C Fs(X) be a recognizable forest and 21 = (S, A, A, T, P, ao) a 
finite-copying DCSDT. Then Par(TT 2 t) is semilinear. □ 

We now can state and prove that the hierarchy of (n,R)-transformational languages is 
infinite. 

Theorem 4.8.9 For every natural number n, the inclusions 

ydP”(Rec) C ydPP/(P"(Rec)) C ydPP(P”(Rec)) C ydP”+^(Rec) 


hold. 

Proof. By Lemma 14.7.21 and Corollary 14.6.61 Rec is closed under regular insertion and 
relabeling. Thus, by Theorems 14.8.6114.5.41 and 14.7.^ and Corollary 14.8.81 it is enough 
to show that there exist a regular forest T C Ps(X) and a CSDT 21 = (S, X, A, Y, P, oq) 
such that Par(TT 2 i) is not semilinear. For this let S = Si = {a}, A = {oo}, X = {x}. 
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Y = {y} and P = {aoa —>■ clqx —>■ y}- Moreover, let T = {a{x)}*^. Then 

Tt^ = I n = 0,1,...}. Thus, Par(Tr 2 i) = {(2”) | n = 0,1,...}, which is obviously 
not semilinear. □ 

From Theorem 14.8.91 we directly get 
Corollary 4.8.10 For every natural number n the inclusions 

(i) yd7^^(Rec) C yd7^^+l(Rec), 

(ii) 7e’^(Rec) C 7e’^+i(Rec), 

(iii) C 

hold. □ 

Finally, we give two more hierarchies of transformational languages, surface forests 
and tree transformations. 

Theorem 4.8.11 For every natural number n the inclusions 

yd7^”(Rec) C ydJ^^\Rec) C yd7e"+^(Rec) 

are valid. 

Proof. By Theorems 14.3.31 and 14.3.1^ and Corollary 14.6.61 the inclusions yd7^”(Rec) C 
yd J^’''^(Rec) C yd7^”''''^(Rec) hold. By the proofs of Theorems 14.8.61 and 14.8.91 
yd7^"^(Rec) is a proper subclass of yd'R(7^"(Rec)). Moreover, by Theorems 14.3.31 
and 14.3.121 and Corollary 14.6.61 the equality 4^(7^"(Rec)) = T'”’''^(Rec) holds. 

Thus, the inclusion yd7^”(Rec) C ydJ-''^'*“^(Rec) is valid. Finally, by Theorem 
14.8.91 yd?^(7^”(Rec)) C ydP7^(7^”(Rec)) C yd7^”’''^(Rec). Therefore, the inclusion 

ydT'”+^(Rec) C yd7^"+^(Rec) is also valid. □ 

From Theorem l4.8.11l using Theorems 14.3.31 and 14.3.1^ and Corollarv l4.6.6l we get the 
following results. 

Corollary 4.8.12 For every natural number n the inclusions 

n^{Rec) C r"+^(Rec) C 7^”+^(Rec) 

hold. □ 

Corollary 4.8.13 For every natural number n the inclusions 

(i) ydJ^(Rec) C ydJ^”+^(Rec), 

(ii) T'"(Rec) C J^+i(Rec), 

(iii) c 

are valid. □ 


192 
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4.9 THE EQUIVALENCE OF TREE TRANSDUCERS 

Since the equivalence problem for (nondeterministic) generalized sequential machines 
is undecidable, there exists no algorithm to decide for arbitrary two tree transducers 
whether or not they are equivalent. In this section we show that there is an algorithm 
for deciding the equivalence of two tree transducers when at least one of them induces 
a partial mapping. Moreover, we shall prove that it is decidable whether the tree trans¬ 
formation induced by a given tree transducer is a partial mapping when restricted to a 
given recognizable forest. 

We start by introducing a concept. 

Definition 4.9.1 Let p G Fj]{X). A tree p' G Fj]{X U H”') is called a supertree of p if 
there are trees pi, ■ ■ ■ ,Pn G Fj:;{X) such that p = p'{pi, ■ ■ ■ ,Pn)- 

To prove the decidability results we shall give five reduction rules formulated in the 
following hve lemmas. In these lemmas 21 = {T,, X, P, A') will be a fixed R- 

transducer and B = will be a fixed SA-recognizer with B = {B,T,) and 

T(B) = T. Furthermore, set Q = {p G T | \pT(^\ > 2}, i.e., Q consists of all trees from T 
which are translated into at least two different output trees by 21 . 

Lemma 4.9.2 Let pi,p 2 G Fj^iX U Hi), pa G Fs(A), ni,n[,n 2 ,n 2 > 0, 

qiePniYUEn,), q[ G Fn{Y U E^,), qa G F^^{Y U En,), G 
qa G FciiXY'^, q'g G Fq{Y)'^ 2 , ao,aQ G A' and Uj G A”*, a' G AP (i = 1,2). Moreover, 
set Ai = {oj. I j = 1 ,... ,ni} and A'- = {a'. | j = 1 ,... ,n'A {i = 1 , 2 ). Assume that the 
following conditions are satisfied: 

(i) pi(p2(P3)) € T, 

(ii) aopi qiiaiCP), a'^pi q'fia'Ai'^), 

(hi) aip^i q2(a2Cr); ^iP2^ 

(iv) asPa" qa, a'aPa" qlj, 

(v) Psfi = P2iP3)fi, Ai C A 2 , A[ C A' 2 , 

(vi) for all r G Fb(y)"'i and r' G Fq{Y)A^ 9i(r) 

Then pi(pa) G Q- 

Proof. First let us note that the conditions of Lemma 14.9.21 imply pi(p 2 (P 3 )) G Q. 

Next take two mappings /: {1,..., ni} ^ {1,..., n 2 } and g ■ {!) ■ ■ ■) m ^ {1 ,... ju'a} 
such that ui. = 02 .,..., (i = 1,... ,ni) and afi = 02 (z = 1,...,n)). By (v), there are 

n' 

such mappings / and g. Thus, by (iv), we have aiPa^ r and a'^pg^ r' with r = 
(^ 3 i(i),---,g 3 i(„^)) and r' = ,..., p. This, by (ii) implies aoPi(p 3 ) gi(r) 

and aQpi(p 3 ) =>* Qi{P)- By (vi), (i'i(r) p Moreover by (v), pi(pa) G T. Therefore, 

PiiPs) € Q- ° 
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Lemma 4.9.3 Let pi € Fj:{X U Hi), p 2 G Fj^{X), n,n' > 0, qi £ Fn{Y U H„), 
q[ € FniY U En>), qa G FniY)^, q'2 G F^iY)^', ao,a^ G A', a G A^ and a' G A^'. 
Furthermore, let K he the maximum of the heights of the right-hand sides of the produc¬ 
tions from P. Assume that the following conditions are satisfied: 

(i) PiiP2) G T, 

(ii) aopi gi(aC”), a'^pi gi(a'^”'), 

(iii) ap^ qa, a'p^' q'a, 

(iv) pathi(gi) is an initial segment o/pathi(g'^), and 
/(pathi(gi)) -/(pathi(gi)) > \pA\‘^\B\K, hg(p 2 ) > \pA\‘^\B\. 

Then there exists an r £ Fy,{X) with |r| < jpal such that pi{r) G Q. 

Proof. Set R = {r £ Fy:{X) \ pi{r) £ T, |r| < \p 2 \, ar"’ s, afr'^' =^* s' for some 
s G Fq{Y)'' and s' G Fq(Y)'' }. Obviously, R is nonvoid. Denote by r an element from 
R with minimal length. We prove that pi{r) £ Q and hg(r) < |pAp|i?|. 

First assume that hg(r) > |p^p|i?|. Then there are 

ri,r2 G .Fs(^ U Hi), r^ £ Fj:{X), mi, m'l, m 2 , m '2 > 0, Si G F^(T U H^J, 
s; G U H^,), S2 G F[^'{Y U H^,), s'^ G U H^,), 

S 3 G Fn{Y)'^U s '3 G Fn(y)'"^ hi £ A'^L b' G A< {i = 1,2) such 


that 

(I) r = ri(r2(r3)), ra / 6, 

(II) arf si(bi.er)> aVf' s;(b;e™i), 

(III) bir^”'^ S 2 (b 2 er), b'lrfi s'2(b'2erh, 

(IV) barg”'^ S3, b'2r3™^ s'3, 

(V) rsP = r2(r3)/3, Bi C B2 and B'^ C B'^, where Bi = {hi. | 1 < j < mj, 

B'i = { 5 '. I 1 < i < m'} (i = 1 , 2 ). 

Take two mappings /: {!,..., mi} —>■ {!,..., m2} and g: {!,..., m}} {!,..., m'2} 

such that 61. = (1 < * < rni) and b}, = b'2^^^^ (1 < i < m-})- Obviously, 

aC si(s3^(^j,... ,S3^(^^j) anda't''' s}(s(5^^^^,..., ), where t = ri(r3). More¬ 

over, ri(r3)/3 = r /3 also holds. Therefore, ri(r3) G R, which is a contradiction since 

ki(?'3)| < \r\. 

Thus, we got that hg(r) < |pAp|iI|. Therefore, for arbitrary vectors s G FfilY)" and 
s' G Fn(V)” satisfying ar” s and a'r"" =>* s', the inequalities hg(si), hg(s'i) < 
|pM|^|iI|iF hold. This, by (iv), obviously implies the conclusion of Lemma 14 . 9.31 □ 
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Lemma 4 . 9.4 Letpi,p2,P3 G F^{X U Si), p4 € F^{X), ni,nl,mi > 0 (i = 1 , 2 , 3 ), 

Qi G Fci(Y U ,^,2j^_|_i), G Fci(Y U ,=,„^^i), ri G Fq(Y U .^mi), 
q2 G U S„,), q' G F^^{Y U S,,), ra G F^^^iY U S^J, 

q3GF^^(yuS„3), q' GF^(yuS,,), G F^ {Y U , 
q 4 G F^(y)-^ q^ G Fo(y)" 3 , r 4 G FniY^^ 
ao,aQ G A', aG A, a* G A"-% a' G A^'i^ b* G (i = 1 , 2 , 3 ). 

Moreover, take an r G Fq{Y), and let r' = ri(r2(r3(r4))). Finally, set Ai = {ui- \ j = 
1,..., Ui}, A[ = {a[. I j = 1,..., n'} and Bi = {b[. | j = 1,..., mj (i = 1, 2, 3). Assume 
that the following conditions are satisfied: 

(i) Pl{P 2 iP 3 {P 4 ))) G T, 

(ii) aopi gi(a6,ai^ii), a'^pi gi(ri(bi^™i),a'i^”i), 

(iii) aip-i q2(a2er). ^iP^^ 

ap2 a^i, r2(b2^™^), 

(iv) a2p^^ q 3 (a 3 er). ^2pf <3(^3^!^, 

aP 3 aCi, b2P™^ r 3 (b 3 ^™ 3 ), 

(v) slspI^ q4, a^p”^ q^, ap 4 =^* r, h3p'fi^ r4, 

(vi) P4fi = P3{Pi)fi = P2{p3{PA))fi, AlC A2C ^3, 

A[ C A'2 C ^'3, .Bi = ^2 C B3, 

(vii) r r', pathi(gi) = pathi(g'3). 

Then at least one of the trees Pi{p2{pa)) , Pi{p3{pa)) and pi{p4) is in Q. 

Proof. First note that the conditions of Lenima l 4 . 9 . 4 l imDlv m G Q. Indeed, 

let t = 91(6,92(93(94))) and t' = 6(?i>9693(94)))• Then 

aoPi{p 2 {P 3 {P 4 ))) t{r), aoPi{p 2 {p 3 {P 4 ))) t'{r') 

and t{r) 7^ t{r'). 

Take six mappings fi: { 1 , ...,nj { 1 ,...,nj+i}, gi: { 1 ,...,n'} { 1 ,... ,and 

hi'. {!,..., nii} {1,..., mj+i} (z = 1, 2) 

such that 

Oi. = (i = 1,2, 1 < j < Ui) o'. = (i = 1,2, 1 < j < n'), 

bi. = (* = 1, 2, l<j< rrn). 
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Furthermore, set /a = /i 0 / 2 , 
notations 

93 = 91 ° 92 and /i 3 = hi 

0 /i 2 . Moreover, introduce the 

Si = (43/j(i), • • • , 

43,^(„3))(q4), s; = 


r+“ 

II 

CO 

^3;,3(„,3))(r4), 


S2 q2(44j-2(i))" 

■ ’^4^2(- 2 ))’ = q2 (44^2(1) 


t2 = r2(r4,,^(3),.. 

’ ^^^2 (’" 2 ) 


S 3 (44j-3(j) ) • • • ) 



ta = (^4^3(1)) • • •; 



Then the following derivations obviously hold: 



«oPi(P 3 (P 4 )) qi{r,si), aoPi(p 3 (p 4 )) 

aoPi{P 2 {P 4 )) 4i(r,S2), aoPi{p 2 {p 4 }) (ri(t2),sQ, 

aoPiiPi) 4i(r,S3), a^pi(p 4 ) (ri(t3),s'3). 

It is also obvious that Pi{p3{p4)), Pi{p2{P4)), PiiPi) G T. 

Now assume that pi[p2{P4)) 0 Q- Then, by (vi) and (vii), mi,m2,m3 > 0 and there 
exists an i (1 < i < m2) such that r3^(r4) 7^ '^4h2{i)' choose hi in such a way that 

for some j {1 < j < mi) hi{j) = i holds. Now assume that, under the latter choice of 
hi, none of Pi{p3{p4)') and pi{p4) are in Q. Then we get n(ti) = ^(ta) = 'f- But this 
is impossible since ti. ^ . □ 

Lemma 4.9.5 Letpi,p 2 ,P 3 G Ft,{X U Si), p^ G Ts(X), ni,n^,mi > 0 (i = 1,2,3), 

qi e Fq(Y u S„^+i), e Fn{Y u S^/^+i), n g Fn{Y u S^J, 

q 2 GF^i(yuS„,), q' GF^(yuS„,), r 2 G (T U S^,), 
q 3 G U S„ 3 ), q' G F^^^Y U S„,), r 3 G F™^(y U S^ 3 ), 

q4 G Fo(y)-^ q^ G Fn{Y)^U G Fn{Y)^^ 
ao,aQ G A', a* G A'^^ a' G A^i, b* G A'^^ {i = 1,2,3). 

Moreover, take an r' G Ffi{Y), and let r = ri (r2(r3(r4))). Finally, set Ai = {a,^. | j = 
1 ,... ,nj}, A[ = {a', \ j = 1, ... ,n'} and B = { 6 *. | j = 1,... ,mi} {i = 1,2,3). Assume 
that the following conditions are satisfied: 

(i) Pl{p 2 {P 3 {P 4 ))) G T, 

(ii) aopi 4i(n(bi^™i),ai^”i), a'^pi q[{r', , 

(hi) aip-i q2(a2^r), a'.p? q' 2 (a' 2 C 4 ^), r2(b2^r). 

(iv) 8 . 2 pT q 3 (a 3 Cr), ^2pf q3(4er^). b2Pr r3(b3C^), 
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(v) q4, =^* r^, 

(vi) P4/3 = P3{P4)P = P2{P3{P4))P, 

A1CA2C As, A[ CA'^C A'3, Bi = B2C Bs, 

(vii) r / r', paths(gi) = path3(gf'3). 

Then at least one of the trees Pi{p2iP4:)), Pi{p3{P4:)) and piipi) is in Q. 

Proof. The proof of this lemma is similar to that of Lemma 14 . 9.41 □ 

Lemma 4 . 9.6 Let 

Pi,P2 € Fj:{X U Hi), P3 G -Fs(X), k,l,m,k', 1 ',m' > 0 , 
qi € Fq{Y U Hfc+i), q[ € Fq{Y U Ek'+i), q2 € Tn(T U H/+i), ^ Fci(Y U 

r€F^{YUEm), F € F^'{Y U E^,), qs^FniYUEi), q'^,r € Fn{Y), 
s € Fn{Yy, s' G Fn{Yy', t G FniY^, t' G FniY^', ao,a'o G A', a, a' G A, 
a G A^, a' G A^', b G A^, b' G A^\ c G A"* and c' G A™'\ 

Moreover, set Ai = {a^ | z = 1 ,..., k}, Bi = {hi \ i = 1 ,... , 1 }, Ci = {ci \ i = 1 ,..., m], 
A3 = {a' I z = 1,..., k'}, i?3 = {63 I z = 1,..., and CJ = {c' | z = 1,... m'}. Assume 
that the following conditions are satisfied: 

(i) Pi{P 2 {P 3 )) € T, 

(ii) aopi gi(aCi,a^f), o'oPi q'i{a'f,i,a'f,f), 

(hi) ap2 q2{aCiM[), a'p2 b'^f), 

apl r(cCr), a'pl' 

(iv) ap3 qsir), a'p3 q'^, hpi^ s, b'ps s', 
cpf t, c'pf t', 

(v) Ai C Si U Cl, A'3 C S( U CJ, P3/3 = P2{P3)$, 

(vi) pathi(g3) = pathi(gi)pathi(g3) and r 93- 
Then Pi{p3) ^Q- 

Proof. Introduce the notation d = (b,c), d' = (b',c'), u = (s,t) and u' = (s',t'). 
Moreover, take two mappings /c} —>■ + m} and g: {l,...,k'{ —)> 

+ m'} satisfying the equalities ai = df(^i) {1 < i < k) and o( = Ug(j) 
(1 < z < k'). Obviously, there are derivations aoPi(p3) qi{q3{'f),Uf(ip ... and 

“oPibs) • ■ ■ ’^g(fe'))- Moreover, pi(p 3 ) e T. Since 

pathi(gi(g 3 (?i), • • •, Uf(k))) = pathi(g'i(Ci, ■ ■ ■, ^^g(fc'))) 
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and q'^ 7 ^ r,qi{q 3 {r), .. ■ ,'U/(fe)) / q[{q'^,u'g^^y .. ■ Hence, ^ 1 (^ 3 ) € Q- ° 

Now we are ready to state a theorem from which the main decidability results of this 
section easily follow. 

Theorem 4.9.7 There exists an algorithm to decide whether Q is empty. 

Proof. Let K denote the maximum of the heights of the right-hand sides of the pro¬ 
ductions from P, ll^ll = and let L be the number of all words over {1,..., rs} with 
length at most ||A|p|i?|iL, where is the maximal m for which 7 ^ 0- Moreover, let 
k = \\A\\^\A\^\B\2L + lj = k + {2\\Af\A\\B\){\\A\\^\B\K + 1) and m = / + 2\\Af\B\. 

We shall show that Q is nonvoid iff it contains a tree with height less than m. The 
case K = 0 being obvious, we assume that K ^ 0. 

Let p be an element of Q with minimal length, and q,q' G Fq{Y) trees such that 
q ^ q' and {p,q), {p,q') G r^- Assume that hg(p) > m. Then there are ao,aQ G A', 
Po,--- ,Pm € FsiX U Hi), pm+i G Ts(A), ni,n' > 0 (i = 0,... ,m), qo G Fq{Y U 
q'o G Fn{Y U H„^), q* G F'^'~\Y U “nj, G {Y U (i = 1,... ,m), q^-^i G 
q^+i G Fn{Y)Ym^ a* G a' G A^t (i = 0,... ,m) such that the following 
conditions are satisfied: 

(1) p = Po{pi{- ■ ■ {Pm+i) ■■■)), Pii^ii (i = 1, ■ ■ ■ ,m), 

(2) g = go(qi(-• • (qm-Hi)• • •))> 4 '= 4o(qi(---(qm+i)•••)), 

(3) aoPo 4o(aoCr)> “oPo 4o(aoC"°), 



(i — 0 , . . . ,m 1 ), ^mPn^l ^ qm-l- 1 ) ^mPn^l ^ qm+l' 

For i = 0, introduce the notations pi = Po{pi{. ■ ■ {pi) ■ ■ ■)), qi = 

4o(qi(- • • (qi) • • •)) and q[ = 5o(q'i(- • • (q() •••))• Moreover, let pi = Pi+i{... {pm+i) ■■■), 
qi = qi+i (... (qm+i)...) and = q'^^ (... (q(„+i)...) (i = 0,..., m). Finally, set 
Ai = {oj. I 1 < j < ni} and A'- = {a'-. | 1 < j < n'} (i = 0,...,m). 

j I'j 

If qi{r) / q'lir') holds for all r G Fq{Y)'^^ and r' G Fq{Y)Yi^ then the fact that 
m — I + 1 > |pAp|i?| makes Lemma 14.9.21 applicable and hence there are i and j with 
I < i < j < m such that pi{pj) G Q. This is obviously a contradiction since \pi[pj)\ < \p\. 

Thus, we may assume that at least one of ni and n[, say ng is greater than 0. Moreover, 
it can also be supposed that there are an p {1 < ii < ni), an r' G Fn{Y U Hi) and an 
s' G Fq{Y) such that q' = r'(s'), pathi(r') = pathj^(g;) and s' 7 ^ Then for each 
j < I, nj > 0. Now let ij {0 < j < I, 1 < ij < nj) be those uniquely determined integers 
for which pathj^. (g^) are initial segments of pathjj(g;). Without loss of generality, we may 
assume that io = ... = ii = 1. 

Now suppose that there exists no rc G {pathj(g() | 1 < i < n)} such that pathi(g;) is an 
initial segment of rc or tc is an initial segment of pathi(gi). Then for each i {I < i < m), 
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set 

Bi = {o-ij I path;^(g/) is an initial segment of pathj(gi)} 

and 

Ci = {ui- I path^(( 7 /) is not an initial segment of pathj(gj)}. 

Since the cardinality of , m} is 2||^|p|i?| + 1, there are ii, ^ 2 , *3 {I < h < i 2 < h ^ 
m) such that the following conditions are satisfied: Bi^ = -Bjj C 

Ci.^ C (7*2 C (7*3 and C A'-^ C From this, by Lemma [4.9.51 we get that at least 
one of the trees pi 2 {Pi 3 )-,Pii{Pi 2 ) and piiipi^) is in Q, which is again a contradiction. 

Therefore, for an ii (!<*;< n^), pathj^(gj) is an initial segment of path^(g;) or 
path;^(gi) is an initial segment of pathj^(g[). Let ij {0 < j < I, 1 < ij < n') be 
those uniquely determined integers for which pathj^. (ql) are initial segments of pathj^. (g^). 
Without loss of generality we may assume that io = ... = ii = 1. We can also assume 
that path^(g/) is an initial segment of path^(g[). 

Now let us distinguish the following two cases: 

a) path^(g^) is an initial segment of path^(g/). If in addition for some i {0 < i < k), 
abs(Z(path^(gj)) — Z(path;^(g'))) > ||T|p|i?|iL then, by Lemma [4.9.31 there exists 
an r G Fq{Y) such that qi{r) G Q and |r| < \pi\. (Here abs stands for absolute 
value.) This obviously is a contradiction. Therefore, for each i {0 < i < k), 
abs(Z(path^(gj)) — Z(path^(g'))) < ||H|p|H|iL. Then, since the cardinality of 
{!,..., A:} is ||H|p|Hp|H|2L + 1, for some integers i and j {I < i < j < k), 
we have: 

(I) path^(gj) is an initial segment of path;^(g'), path^(gj) is an initial segment of 
pathi(gp, pathi(g')/pathi(gi) = pathi(gp/pathi(gj), or 
(II) path^(g') is an initial segment of path;^(gj), path]^(gp is an initial segment 
of path;^(gj), path;^(gj)/path^(g') = path;^(gj)/path]^(gp. (Here uv/u = v for 

any two words u and v.) Moreover, pj/] = pifd, oq = = a'^, Bi C Bj 

and B[ C H', where Bg = | 2 < t < n^} and H' = {a'g^ | 2 < t < 

n'g} {s = i,j). Then, by Lemma [4.9.61 pi{pj) G Q, which is a contradiction 
since \p{pi^)\ < \p\. 

b) path^(g/) is an initial segment of path^(g(,). We shall show that 

Z(pathi(gi)) -Z(pathi(4)) > ||H||^|H|iL. 

Then Z(path^(g^)) — Z(path]^(gfc)) > ||H|p|i?|iL will also hold, which, by Lemma 
14.9.31 will be a contradiction. 

Thus, assume that Z(path]^(gi)) — Z(path^(gfc)) < ||H|p|i?|iL. Then, since the 
cardinality of {k + 1,... ,1} is (2||H|p|H||H|)(||H|p|i?|iL + I), there are ii and 
*2 (k < ii < 12 < 1) such that 12 — h = 2||H||^|H||H| and path;^(gq) = ... = 

pathi(gi 2 ), i.e., g(q+i)i = ... = qi^^ = Ci- Now for each j {h <j< 12 ) set 

Bj = {a'j^ I 1 < t < n'j, path^(g(J is an initial segment of path]^(g')} 
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and 


Cj = {a'j^ I I <t < n'j, path]^(g'^) is not an initial segment of path^(g')}. 

Since the cardinality of {ii,..., ^ 2 } is 2||^||^|A||i?| + 1, there are integers ji ,42 and 
J3 {k < 31 < h < h < k) such that = pjJ = pjJ, 

Aj^ C Aj^ C Aj^, Bj^ = C Bj^ and Cj^ C Cj ^ C Cj^, where Aj, = {aj^^ \ 2 < 
s < rij^} (t = 1, 2, 3). Therefore, by Lemma fd.Q.dl at least one of the trees Pj 2 {pj 3 ), 
PjiiPj 2 ) PjiiPjs) is in Q which is again a contradiction. □ 

Now we are ready to prove 

Theorem 4.9.8 For any two R-transducers 21 = {E,X,A,Il,Y,P,A') and 23 = 
{T,X, B,Il,Y, P', B') and any recognizable TX-forest T it is decidable 

(i) whether t%\T is a (partial) mapping, 

(ii) whether t%\T C r^lT, provided that t<q\T is a (partial) mapping, 

(iii) whether 21 is equivalent to 23, provided that or r® is a (partial) mapping, 
and 

(iv) whether 21 is equivalent to *B, provided that at least one of them is deterministic. 

Proof. By Theorem 14.9.71 (i) is true. Moreover, (iii) and (iv) follow from (ii) since the 
domain of an R-transformation is regular and, by Theorem 12.10.31 it is decidable for two 
regular forests whether one of them contains the other one. Therefore, it is enough to 
prove (ii). 

We may assume that A n B = 0. Let us construct an R-transducer (t = 
{T,X,C,n,Y,P'',C') with C = AU B, C = A' U B' and P" = P U P'. Obviously, 
r^lr = r2i|r U t®|T. Thus t%\T C holds iff dom(r 2 t) fl T C dom(T(8) 0 T and T(r|T 
is a partial mapping. □ 

Before stating the analogous result for F-transducers we prove a lemma. 

Lemma 4.9.9 For any F-transducer 21 = {T, X, A, A,Y, P, A') and R G Rec(S,X) 
one can effectively give an R-transducer 23 = {Ft, X, B, A,Y, P', B') and a forest 
S € Rec(n,X) such that t^\R is a partial mapping iff t^\S is a partial mapping. 

Proof. Construct an R^-transducer 21 = {T,X,A,A,Y,P,A') where P is given as 
follows: 

(i) If X —>■ or (x G X, a G A, r G Fj\{Y)) is in P, then ax ^ r is in P. 

(ii) If ct(oi, ... ,am) -)■ ar (cr G Sm, m > 0, ai,.^,am,a G A, r G Pa(R U S^)) is 
in P, then {aa r{aifi,... ,amim),D) is in P, where P(^j) = dom(T 2 i(ai)) {i = 
l,...,m). Since, by Theorem 14.1.101 fib dom(r 2 i(a)) (a G A) is regular, 21 is an 
Rpt-transducer. Observe that T 2 i(a) U holds for every a G A. 
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We shall show that for all {a, a'} C A and p € Fy,{X) the equivalence 


ka(a)(p) Ura(„/)(p)| > 1 ^ T%a')iP)\ > 1 (1) 

holds. (Note that a and a' are not necessarily distinct.) 

Since T^{^a) ^ '^ 2 r(a)’ (^) implies its right side. 

The converse will be proved by induction on hg(p). If hg(p) = 0, then our statement 
obviously holds. Now let p = a{pi,... ,pm) {o' G Tm,'rn > 0,p G F^{X)) and r,r' G 
Fa(T) be such that ap r, a'{p) r' and r ^ r' . Moreover, assume that the right 
side of (1) implies its left side for every state and every SX-tree of height less than 
hg(p). 

Let us write the above derivations in the form 


acj afpf {i = I,... ,m) 


and 


a'a ..., b'^i'pT • • • > 

where a, a', ai,bi £ A, i = 1,... ,m, ni + ... + = n, n'^ + ... + = n', 

r G F^{Y UH„), r' G Fa{Y UH„/), r(ri,... ,rm) = r and 
r'(ri,... ,rm) = r'. Moreover, (cr(ai,...,a^), 

{a{bi,...,bm), G P. 


Now distinguish the following two cases: 

(I) There exists an i (1 < i < m) with n* > 0 and \'T'^(^g_.'^{pi)\ > 1 or there exists a 
j (1 < J < rn) with n' > 0 and \o-^(^f,.'^{pj)\ > 1. Then, by the induction hypothesis, 

\o<^{ai){Pi)\ > 1 or \T^(bi){Pj)\ > 1- Therefore, by the definition of P, \T<^(a){p)\ > 1 
or |ra(a')(p)| > 1 also holds. 

(II) Assume that there are no i and j satisfying (I). Then, rq = ... = = r* 

(1 < i < m) if nj > 0. For all such i, by T<^(ai) ^ '^a(ai) choice of D, we 

have Pi aiVi. Moreover, again by the choice of ZI, if Uj = 0 then also there 
exists an r* G FaIY) such that pi o^rj holds. Thus, we have the derivation 
p ar. Using similar arguments, one can show that p a'r' is also valid. 
Therefore, \T^(a){p) ^ o^{a')iT)\ > 1- 

Thus, we have proved that t%\R is a partial mapping iff t-^\R is a partial mapping. By 
Theorem l4.4.6h iL there exist a deterministic F-relabeling r; Fs(A) —)• Fci{X) and an R- 
transducer = (11, X, B, A, Y, P", B') such that = ror®. Moreover, by Lemma l4.6.5[ 
Rt = 5 is in Rec(n, A) and S can be obtained effectively from R. Therefore, t-^\R is a 
partial mapping iff rig | S' is a partial mapping. □ 

Now we state and prove 
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Theorem 4.9.10 For any two F-transducers 21 = (E, X, A,D,Y, P, A') and ® = 
{E,X, B,Fl,Y, P', B') and recognizable EX-forest T, it is decidable 

(i) whether t%\T is a partial mapping, 

(ii) whether t%\T C r<B|r, provided that Tgs|r is a partial mapping, 

(iii) whether 21 is equivalent to provided that or rgg is a partial mapping, and 

(iv) whether 21 is equivalent to *B, provided that at least one of them is deterministic. 

Proof. Obviously, (i) follows from Theorem 14.9.81 by Lemma 14.9.91 Moreover, (ii) im¬ 
plies (iii) and (iv) since, by Theorem I4.1.1UI (i), the domain of an F-transformation is 
recognizable. Thus, it suffices to prove (ii). 

Assume that An B = (/i, and construct the F-transducer 

e:= {E,x,c,n,Y,p'',c') 

with C = An B, C = A' U B' and P" = P U P'. Obviously, Tg; = rgi U r®. Therefore, 
t<^\T C T 3 s|r iff dom(r 2 i) O T C dom(T®) fl T and T(i\T is a partial mapping. □ 

4.10 EXERCISES 

1. Define generalized sequential machines as tree transducers when strings are inter¬ 
preted as unary trees in the usual way. 

2. Let T be a DR-transformation. Then dom(T) can be recognized by a DR-recognizer. 

3. Show that the classes CTDT and CVTZ, and similarly the classes CNVF and 
CAfVTZ, are incomparable. 

4. Let us call a DR-transducer 21 = (S, X, A, D, Y, P, A') simple, if for every acr ^ q £ 
P, whenever ai^j and 02^1 occur in q, then oi = 02 . If 21 is a simple DR-transducer, 
then Ta can be induced by an F-transducer. 

5. Prove that DTZ is not closed under composition. 

6. The composition of a totally defined DR-transformation by an R-transformation 
is an R-transformation. 

7. Is TZ closed under composition with LR-transformations from the right? 

8. Show that P is not closed under composition with LNF-transformations from the 
right. 

9. Prove Theorems 14.3.71 and 14.3.91 

10. Find two R-transformations ti and T 2 such that ri o T 2 is the F-transformation 
given in Example 14.1.31 
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11. Give two F-transformations whose composition is the R-transformation of Exam¬ 
ple HRRl 

12. Show that J- and Ttn are incomparable. 

13. Prove that T>TZr is closed under DF-transformations. 

14. An F-transformation (or an R-transformation) is a partial mapping iff it can be 
induced by a DRR-transducer. 

15. Find a DR^-transducer which is not equivalent to any DR-transducer. 

16. The equivalence problem of two RR-transducers is decidable, provided that at least 
one of them induces a partial mapping. 

17. Find an algorithm to decide for an F-transducer whether it is equivalent to an 
LF-transducer. 

18. Let 21 = (S, A, A, Y, P, A') be a GSDT and a ranked alphabet. Let {ni,..., rir} 
be the set of lengths of right-hand sides of all rules from P (each element of AH is 
counted as one symbol). Moreover, let r(n) = {mi,... ,ms}. Assume that there 
exists a mapping /: {ni,..., rir} —)• r(fl) such that the equality 

rik = mf(^k) + himi - 1) + ... + hinis - 1 ) 

holds for every k{= 1,... ,r), where h,... ,ls > 0. Then there is an R-transducer 
25 = {Y,X,B,n,P',B') with = {(p,yd(g)) | {p,q) G r®}. 

19. Find an R-transducer 21 such that preserves recognizability, but 21 is not equiv¬ 
alent to any LF-transducer. 

20. An R-transducer 21 = (S, A, A, 12, T, P, oq) is called k-metalinear if the following 
conditions are satished: 

(1) oq does not appear in the right-hand sides in rules from P, 

(2) for each rule aoa q {a G Em) in P every (1 < i < m) can occur in q at 
most k times, and 

(3) for each rule aa ^ q {a ^ ao, a G S^) in P the number of occurences of each 

(1 < * < m) in O' is 0 or 1. 

Let 21 be a A:-metalinear R-transducer. Does preserve recognizability? 

21. For a ranked alphabet E let E = Sq U Ei be the ranked alphabet with Eg = Eg 
and Yi = {a \ a G E^, m > 0}. Dehne the mapping ph: Ps(A) pPg(A) 

by ph((i) = {d} (d G Eq U A) and 

ph(fT(pi,... ,pm)) = {d{t) I t G ph(pi) U ... U ph(pm)} 

(fj G Em, m > 0, pi, ... ,pm G Ps(A)). Show that if T G Surf(77) then ph(T) = 
U (ph(t) I t G is recognizable. 
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22. Is Surf(7^) closed under intersection? 

23. Give a recursive definition of the concepts of state-sequence and production- 
sequence. 

24. For every F-transducer there is an equivalent totally defined F-transducer with a 
single final state. 

25. For every DF-transducer (DR-transducer) one can effectively give an equivalent 
DF-transducer (DR-transducer) with a minimal number of states. 


4.11 NOTES AND REFERENCES 


The concept of the R-transducer was introduced by Rounds 215( | and Thatcher 238l | 
thus extending generalized sequential machines from strings to trees and to give a tree 
automaton formalism for parts of mathematical linguistics (in particular, f or th e theory 


of syntax directed compilation). The F-transducer is due to Thatcher 239l |. As 


m 


the case of tree recognizers, many of the authors dealing with tree transducers allow a 
symbol from a ranked alphabet to have more than one rank, and most of them use no 
separate frontier alphabets. 


The results of Section |T2] can be found in Engelfriet 7^, and most results of Section 
Ol are also from this work. Theorems 14. 3. 3114. 3. 121 [4. 3. 13l were obtained by Baker ( 2 ^ . 
Tree transducers with regular look-ahead are defined and investigated in Engelfriet 
. Generalized syntax directed translations were introduced by Aho and Ullman Q] 
in the special case where the domain of the translation is the forest of all parse trees of a 
given context-free grammar. (Parse trees are almost the same as our production trees.) 
Applying a generalized syntax directed translation in the sense of Aho and Ullman is 
equivalent to applying a DGSDT of Section 14.51 which, by Theorem 14.5.41 is equivalent 
to applying a DR-transducer and then taking the yield of the resulting tree. The more 
general concept of a GSDT was introduced in Baker 28|]. In the same work she proved 
that for each n, ydSurf(77"') and ydSurf(T'”) are properly contained in the family of 
deterministic context-sensitive languages. 

The results of Section HR] are from Engelfriet Gegseg |in2l | and Rounds 215]. 

The first result about the Surf(77”')-hierarchy can be found in Ogden and Rounds 


190(1 , where they proved that Surf(77) is a proper subclass of Surf(77^) and conjec¬ 


tured the properness of the hierarchy. It was Engelfriet [80|, l83| who succeeded in 
proving that the 77”-, Surf(77”)-, and ydSurf(77”)-hierarchies (and their F-transducer 
counterparts) are proper. Section ITTl and ITHl are based on his work. 

The de cidab ility results of Section f4.9l are from EsiK 90|. Using a different technique 


Zaghar [ 254 ] also proved the decidability of the equivalence problem of DF-transducers. 

As a conclusion we mention some other topics relevant to the subject matter of Chap¬ 
ter 01 

A sequential program machine {sp-machine) introduced by Buda 4^ is such a gen¬ 


eralization of a gsm whose inputs are strings and whose outputs are n-tuples of n-ary 
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4.11 Notes and references 


trees. Buda showed that the equivalence problem of sp-machines is solvable and that 
this implies that the equivalence of certain program schemes is also decidable. 

Engelfriet and File introduced a new type of tree transducer called macro tree trans¬ 
ducer which is a combination of the R-transducer and the context-free tree grammar 
(see Engelfriet 8^). They propose to use macro tree transducers to model attribute 
grammars of D. E. Knuth (Math. Systems Theory 2 (1968), 127-145: Correction: ibid 
5 (1971), 95-96). For tree transformations in terms of magmoi ds we refe r the reader to 
Arnold and Dauchet [H, [3, Dauchet ^, [^, and Lilin isil. Ii6CiI |. 

Finally, we note that much of the category theoretic work mentioned in the Notes and 
References of Chapter [5] deal with tree transductions. 
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root-to-frontier tree transducer, 11361 
with regular look-ahead, 11551 
RR-transducer, 11551 
deterministic, 11551 
linear, 11551 
nondeleting, 11551 
RR-transformation, 11561 
deterministic, 11561 
linear, 11561 
nondeleting, 11561 
R-relabeling, 11431 
R-transducer, 11361 
fc-metalinear, m 
deterministic, 11431 
finite copying, 11781 
fc-copying, 11781 
linear, 11431 
nondeleting, 11431 
totally defined, 11431 
R-transformation, 11361 

set 

free generating, [291 
generating, m 
Parikh, 142] 
power, [5] 
quotient, [5] 

S-algebra, see algebra, [TT] 
fj-catenation, [50] 
cT-product, [69] 

S-term in X, [T7] 

S-tree, [48] 

SX-forest, see also forest, M 
SX-recognizer 
connected, IHTl 
connected DR, 11081 
deterministic root-to-frontier, 1591 

DR, [59] 

frontier-to-root, 152] 
minimal, [82] 
minimal DR, IllOl 
NDF,[56] 

NDR,[57] 
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nondeterministic frontier-to-root, [56] 
nondeterministic root-to-frontier, [571 
quotient, m 
quotient DR, 11071 
reduced, EH 
reduced DR, 11081 
SX-term, [T7I 
SX-tree, 1751 
atomic, \m 

(S, X, A;)-polynomial, [55] 
regular, [90] 
sp-machine, 12041 
state 

copying, 11671 
deleting, 11671 
nondeleting, 11671 
of F-transducer, 11321 
of GSDT, [162] 
of gsm, SI] 

of Mealy machine, [43] 
of NDR SX-recognizer, [ST] 
of recognizer, [27] 
of SX-recognizer, [52] 
state-sequence of 
GSDT. [T78] 

R-transducer, 11781 

structural equivalence of CF grammars, 

[M] 

subalgebra, [l2] 

generated by a set, [12] 
sub derivation in 

F-transducer, [IS] 

R-transducer, 11371 
subrecognizer, [341 
subset 

closed, [T2] 

closed with respect to operation, [TO] 
linear, [42] 
recognizable, 11121 
semilinear, [42] 
subset construction, EH 
substitution, 11131 
subtree, [42] 
supertree, 11931 


supremum, [24] 
surjection, [5] 

syntactic pattern recognition, 11151 
term, [13 

TF-transducer, 11421 
theories, 11141 
TR-transducer, 11431 
transformation induced by 
F-transducer, 11321 
Ru-transducer, 11561 
R-transducer, 11361 
translation, 03] 

elementary, [23] 
induced by GSDT, 11631 
induced by gsm, [44] 
induced by Mealy machine, 03] 
induced by tree transformation, 11311 
tree, 051 

derivation, 
infinite, 11151 
parse, 12041 
production, 11211 
tree transducer 

frontier-to-root, 
macro, 12051 
root-to-frontier, m\ 
tree transformation, 11311 

preserving regularity, 11651 

variable, [13 

word, [23 

accepted by recognizer, [25] 
empty, 1771 
? 7 -accepted, 11271 
proper, 11811 

x-iteration, [68] 
x-path of SX-tree, 11031 
x-quotient, [68l 
x-substitution, [123 
X-language, see language, [271 
X-recognizer, see recognizer, [271 
X-tree, 05] 
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yield of 

forest, 11171 
tree, 11171 

z-product, [65] 

0-state, I1U4I 
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SOME FURTHER TOPICS AND REFERENCES 

by Magnus Steinby 


The purpose of this Appendix is to supplement the original book with notes on some 
further topics and a selection of more recent references. The choice of topics and ref¬ 
erences is partly influenced by personal preferences, but I trust that the areas included 
deserve to be mentioned, and that the general expositions, surveys and research papers 
appearing in the bibliography are useful. Hence, I hope that these notes may serve as 
an initial guide to the subjects discussed, and that they give an idea of the continuing 
vitality of the theory and of its applications. 

Before considering any specific areas, let me note some works of a general nature 
published after Tree Automata was written. J. R. Biichi’s posthumous book Finite Au¬ 
tomata, Their Algebras and Grammars [ll| appeared in 1989 (edited by D. Siefkes). 


The main part of it treats unary algebras, finite acceptors, regular languages and pro¬ 
duction systems, but in a manner that suggests tree automata and tree languages as 
natural generalizations. The last two chapters deal with terms, trees, algebras as tree 
automata, tree grammars, and connections between context-free languages and push¬ 
down automata. Especially this latter part of the book appears quite unfinished, but 
the author’s grand design, a theory that would encompass algebras, automata, formal 
languages and rewriting systems, is clearly discernible. The terminology and notation 
is often nonstandard, sometimes even confusing, but a patient reader is rewarded by 
original insights and interesting historical remarks. 

The book Tree Automata and Languages 6^ edited M. Nivat and A. Podelski, which 
appeared in 1992, is a collection of papers that discuss a variety of topics involving trees. 
The survey paper [i^ by F. Gecseg and M. Steinby may be viewed as a condensed and 
somewhat modernized version of Tree Automata, but it also takes up some further topics 
and its bibliography includes many additional items. 

In their book Syntax-Directed Semantics. Formal Models Based on Tree Transducers 


44l |. Z. Fiilop and H. Vogler consider formal models of syntax-directed semantics based 
on tree transducers. They also develop a fair amount of the general theory of total 
deterministic top-down, macro, attributed, and macro attributed tree transducers. In 
particular, they compare with each other the classes of tree transformations defined 
by the different types of tree transducers, and they present several composition and 
decomposition results for these tree transformations. 

The internet book Tree Automata Techniques and Applications 


1^, to be referred to 


as TATA, is a joint enterprise of several authors. First launched in 1997, it has already 
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been revised and extended a few times. The presentation is often rather informal, but 
the ideas are richly illustrated by examples and many interesting facts are also given as 
exercises. The hrst two chapters review some basic material about finite tree recognizers, 
regular tree languages, and regular tree grammars, but also mention context-free tree 
languages. Chapter 6 contains a brief account of tree transducers (without proofs'). The 
remaining hve chapters deal with topics not covered by our book. The tutorial by 
C. boding focuses on applications of tree automata and emphasizes algorithmic aspects. 

Automata on infinite trees and the connections between tree automata and logic were 
the most important topics excluded from Tree Automata. The two are strongly linked 
with each other and have been studied intensively ever since tree automata were intro¬ 
duced, and by now they form an extensive theory with important applications to logic 
and computer science. Although mainly concerned with the word case, the survey pa¬ 
pers 8l| and 8^ by W. Thomas offer very readable introductions to this area, and they 


also include extensive bibliographies. C hap ter 3 of TATA 1^ is a further useful general 
reference, and some of the papers in 6^ deal with this topic. The book Automata, 
Logics, and Infinite Games edited by E. Gradel, Thomas and T. Wilke contains 
twenty tutorial papers that form an excellent overview of the study of automata, logics 
and games. About half of them concern trees and tree automata. Besides MSO logics, 
they elucidate the uses and properties of various modal logics, fixed-point logics and 
guarded logics, and demonstrate the usefulness of alternating tree automata. 

The continual development of the theory of tree transformations is also largely driven 
by applications, and tree transducers will be mentioned also in connection with some 
the other themes to be discussed below. Here I shall note separately a few important 
topics. The study of compositions of tree transformation classes initiated by B. S. Baker 
(1973, 1979|i] and J. En gelf riet (1975) has been pursued further especially by Fiilop and 
S. Vagvolgyi [d^, ^,43, 3^. In particular, they have considered semigroups of the com¬ 
positions of some given tree transformation classes, and presented rewriting systems by 
which the equality of two composition classes can be decided. They have also considered 
some variants of Engelfriet’s (1977) important idea of regular look-ahead for top-down 
tree transducers ([^, for example). Recently, Engelfriet, S. Maneth and H. Seidl 25] 
have shown that in certain cases it can be decided whether a deterministic top-down 
tree transducer with regular look-ahead is equivalent to a deterministic top-down tree 
transducer, and that such a transducer without look-ahead can be constructed if the 
answer is positive. Macro tree transducers were first defined by Engelfriet (1980) but, as 
noted in 4^ for example, the primitive recursive program schemes independently intro¬ 
duced by B. Courcelle and P. Franchi-Zannettacci [1^ amount to many-sorted versions 
of them. Macro and other hi ghe r-level tree transducers have been studied in depth by 


Engelfriet and Vogler [2y, [271, l29(| (cf. also 2lj, [2^ ). For further information about 

these matters, I recommend the bibliographic notes in 44j. The work @| on equational 
tree transformations by S. Bozapalidis, Fiilop and G. Rahonis is a natural extension of 
a classical theme. 

The decidability of the question whether the image of a given regular tree language 


^The references can be found in the original bibliography of Tree Automata 
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under a given tree homomorphism is regular, has been a relatively long-standing open 
problem, but recently an affirmative solution was presented by G. Godoy and O. Gimenez 
[48|. Their approach uses tree automata with equality or disequality tests, and their work 
contains also some results of independent interest concerning such automata. Moreover, 
it has some applications to term rewriting and XML theory. Fiilop and P. Gyenizse 37l | 
have shown that injectivity is undecidable for tree homomorphisms while it is decidable 
for linear deterministic top-down tree transformations. Furthermore, in 3^ Fiilop proves 
that several questions concerning the ranges of deterministic top-down tree transforma¬ 
tions are undecidable. The decidability of the equivalence of deterministic top-down tree 
transducers was proved by Esik already in 1980. More recently, Engelfriet, Maneth and 
Seidl [2^ showed that the equivalence of total deterministic top-down tree transducers 
can be decided in polynomial time by reducing the transducers to a certain canonical 
form, and their method can be applied also to deterministic top-down tree transducers 
with regular look-ahead. In 3^, S. Eriese, Seidl and Maneth present a corresponding 


equivalence checking algorithm based on normal forms for bottom-up tree transducers. 
In 23|, Engelfriet and Maneth prove that the equivalence of deterministic MSO tree 


transducers is decidable. These results, as well as many other decidability questions for 
tree transducers are discussed in the recent survey paper by Maneth. Einally, two 
quite recent contributions should be mentioned. Firstly, Seidl, Maneth and G. Kemper 
[78| prove the decidability of the equivalence of deterministic top-down tree-to-string 
transducers. In 33|], E. Filiot, Maneth, P.-A. Reynier and J.-M. Talbot introduce tree 
transducers for which every output tree is augmented with information about the origin 
of each of its nodes, and prove several decidability results concerning the equivalence or 
injectivity of such transducers. 

Since terms can be seen as syntactic representations of trees over ranked alphabets, 
it is to be expected that there are some connections between tree automata and term 
rewriting systems (TRSs). Indeed, various tree automata and tree grammars are often 
defined as special term rewriting systems. On the other hand, tree automata can be 
used for solving problems concerning TRSs and such applications have, in turn, inspired 
new developments in the theory tree automata. In the mid-1980s it was noted that the 
set Red(TZ) of terms reducible by a finite left-linear TRS TZ, as well as its complement, 
the set Irr(TZ) of irreducible terms, are regular tree languages. Since this means that 
many questions concerning reducibility and normal forms are decidable for such TRSs, 
the observation was quickly followed by several studies of related matters. Thus it 
was shown that a finite TRS TZ for which Red{TZ) is regular can be “linearized” and 
that the regularity of RediJZ) is decidable, the regular sets Red{TZ) were characterized 
in terms of a new class of finite tree automata, and questions of ground reducibility 
were considered. So-called monadic and semi-monadic TRSs were studied using tree 
pushdown automata. For extending such applications to TRSs that are not left-linear, 
new classes of tree automata are needed. The problem here is that automata that are 
able to recognize also non-regular sets Red{TZ) or the sets of all ground instances of 
a given non-linear term, tend to be too powerful to be manageable themselves. An 
example of increased power combined with good decidability properties is provided by 
the automata with comparisons between brothers introduced in the 1990s. The ground 
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tree transducer is another important tree automaton sprung from the theory of term 
rewriting. Much material concerning these matters can be found in TATA [l^ . and 
introductions to this subject and many references are provided also by the surveys 47l |. 


67l | and [79l |. For some recent work on this theme, cf. 83|, for example. 


Weighted tree automata, tree series and weighted tree transformations have been stud¬ 
ied quite extensively in recent years. Most aspects of this work (up to around 2009) are 
reviewed in the handbook chapter (451] b v Fiilop and Vogler, and a broad introduction 
is provided also by the survey paper [31(1 by Z. Esik and W. Kuich. Weighted logics for 
weighted tree automata have been studied by M. Droste, Vogler and others, cf. 18, 3^, 
for example. Equational weig hted tree transformations are considered by Bozapalidis, 
Eiilop and Rahonis [^. In 71] Rahonis introduces weighted Muller-automata on infinite 
trees and a corresponding weighted MSO-logic. The dissertation 6l| of C. Mathissen 
contains, among other matters, also much interesting material belonging to this area as 
well as a useful bibliography. 

In an unranked tree a node labeled with a given symbol may have any number of 
children. Languages of such trees were considered already in the 1960s in two notable 
papers. J. W. Thatcher (1967) introduced finite unranked tree recognizers and showed 
that the yields of the recognizable unranked tree languages are precisely the context-free 
languages. C. Pair and A. Quere (1968) created an algebraic framework for the study 
of regular unranked tree languages that also incorporated hedges, i.e. finite sequences 
of unranked trees, and they proved many of the usual properties of regular sets for 
recognizable unranked tree languages. Nevertheless, the topic received little attention 
before it was discovered that it is natural to represent XML documents by unranked 
trees and that unranked tree automata may be useful for handling questions concerning 
them. The revival of the theory of unranked tree and hedge languages by M. Murata et al. 

initiated a lively activity in the area. TATA devotes a chapter to unranked 
tree languages and their ^plications. As a sample from the extensive literature, let us 
mention just the papers 0, [i^, [^, 0 and the survey by T. Schwentick. Since this 


work is mostly quite application-oriented, algorithmic and complexity issues are much 
to the fore. X. Piao and K. Salomaa 69|, [70(| have considered state complexity questions 
connected with conversions between different types of unranked automata as well as 
lower bounds for the size of unranked tree automata. An overview of logics for unranked 
trees is given by L. Libkin j^. Weighted unranked tree automata are studied in 19] 
and [l7| by Droste, Vogler, and D. Heusel. 

Natural language description and processing has become an important area of applica¬ 
tion of the theory of tree automata and tree languages. Of course, parse trees of natural 
languages have always been prime examples of ‘trees’ and some of the early works on 
tree automata explicitly refer to linguistic motivations, but the current activity took 
really off much later. In his book [6^ E. Morawietz discusses formalizations of natural 
language syntax that are based on monadic second-order (MSO) logic on trees and tree 
language theory. A key fact here is the effective correspondence between weak MSO logic 
and finite tree automata established already by Thatcher and J. B. Wright (1965, 1968) 
and J. Doner (1965, 1970), but actually a whole array of tree language-related notions 
are utilized or noted as potentially useful. These include tree walking automata 


236 




















211, and tree-adjoining grammars (cf. [^, for a survey). 


macro tree transducers j 
Recently, the theory of tree automata has attracted the attention of linguists especially 
because of the promise shown by tree-based approaches to machine translation. Besides 
classical notions and results appearing already in our book, work in this area draws 
also upon some newer developments. In particular, it has both utilized and inspired 
work on unranked and weighted tree languages as well as weighted tree transducers. 
Furthermore, it has revived the interest in the generalized top-down tree transducers 
studied much earlier by E. Lilin (1978). Also compositions and decompositions of vari¬ 


ous tree transformations are used in machine translation systems. The papers 15,81 1 


expose some of the relevant questions from a linguist’s point of view, while the papers 
[^ . 49, ^,[ 5 ^ form a sample of theoretical work in the area. 


Almost all papers on varieties of tree languages, and classes of special regular tree 
languages in general, have appeared after 1984. Most of the work in this area published 
before 2005 is at least mentioned in the survey 80(], and all the references pointed to 


(by author and year) below can be found there. Eilenberg-like variety theories for tree 
languages were presented by Steinby (1979, 1992, 1998) and J. Almeida (1990, 1995). 
Esik (1999) has set forth a variety theory in which Unitary algebraic theories take the 
place of finite algebras, and later he together with P. Weil [3^ formulated a similar theory 
in terms of preclones. Syntactic monoids of tree languages were introduced by Thomas 
(1982, 1984) and studied further by Salomaa (1983). A similar notion for binary trees 
has been used by Nivat and Podelski (1989, 1992). The families of regular tree languages 
considered in the literature include those of the finite and co-finite tree languages (Gecseg 
and B. Imreh 1988), definite, reverse definite and generalized definite tree languages 
(U. Heuter 1989, 1992), ^-testable tree languages (Heuter 1989, T. Knuutila 1992), and 
aperiodic tree languages (Thomas 1984). All of them are varieties of tree languages (cf. 
Steinby 1992, 1998), and in some cases the corresponding varieties of finite algebras are 
also known. 

Although Thomas (1984) could characterize the aperiodic tree languages by their syn¬ 
tactic monoids, it was obvious that such a characterization is not possible for all varieties 
of tree languages. This was confirmed when S. Salehi described the (generalized) va¬ 
rieties definable by syntactic monoids or semigroups. His result shows, for example, that 
the definite tree languages cannot be characterized by syntactic semigroups (as claimed 
in an earlier paper). However, in 1^ A. Cano Gomez and Steinby introduce generalized 
syntactic semigroups (and monoids) in terms of which the definite tree languages can 
be characterized. Wilke (1996) gave an effective characterization of the reverse definite 
binary tree languages in terms of so-called tree algebras. Salehi and Steinby [7^ stud¬ 
ied the tree algebra formalism in some detail and presented a variety theorem for it. 
Noticing that the well-known equivalence of aperiodicity, star-freeness, and first-order 
definability of string languages fails for trees, Thomas (1984) introduced logics in which 
set quantifications are limited to chains or to antichains of nodes. He proved then, for 
example, that a regular tree language is star-free iff it is antichain-definable. This line of 
research has been pursued further by Heuter (1989, 1991) and A. Potthoff (1994, 1995), 
for example. 

Some families of tree languages have been introduced by first defining a class of finite 
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algebras. For example, the monotone tree languages studied by Gecseg and Imreh (2002) 
were defined as the languages recognized by monotone algebras. Similarly, Esik and 
Sz. Ivan 30(] introduce a hierarchy of aperiodicity notions for finite algebras and consider 
then, besides the properties of the obtained varieties of finite algebras, the corresponding 
families of tree languages. There are a few different extensions of the variety th eory 
of tree languages: positive varieties of tree languages by T. Petkovic and Salehi [68l |. 
varieties of many sorted sets (with tree languages as a special case) by Salehi and Steinby 


and varieties of recognizable tree series by Fiilop and Steinby |38l ]. 

A section of Tree Automata is devoted to deterministic root-to-frontier (DR) recogniz¬ 
ers and DR tree languages, but the topic has been studied quite extensively also later. In 
her thesis E. Jurvanen (1995) considers closure properties and the variety generated by 
DR tree languages as well as ways of strengthening DR recognizers. The latter include, 
in particular, the regular frontier check mechanism introduced by Jurvanen, Potthoff and 
Thomas (1994). The thesis is also a good general introduction and a reference for work 
done before 1995. In the synchronized deterministic top-down automata of Salomaa 
[ 75 I . [tgI a limited communication between the computations in different branches is al¬ 
lowed. Gecseg and Steinby (2001) introduced syntactic monoids for DR tree languages, 
and these were used by Gecseg and Imreh (2002, 2004) for characterizing monotone, 
nilpotent and definite DR tree languages. In W. Martens, F. Neven and Schwentick 
discuss several aspects of DR-recognition. In particular, motivated by applications to 
schema languages for XML, they study DR recognizers of unranked tree languages. 

The book Grammatical Picture Generation. A Tree-Based Approach 0 by F. Drewes 
is a comprehensive treatment of tree-based picture generation. The picture generating 
systems considered consist, roughly speaking, of a device for producing a tree language 
and a picture algebra that interprets trees as pictures. The devices used for producing 
the tree languages include regular tree grammars, ETOL tree grammars, branching tree 
grammars, and tree transducers. The needed tree language theory is given in several 
inserts in the main text and in a separate appendix. Thus this fascinating book offers 
also a general introduction to tree languages. 

A great number of concepts and results from several branches of mathematics are 
used in the theory of tree automata. However, as a conclusion of this appendix, I shall 
mention some introductions to just two subjects most intimately connected with tree 
automata: universal algebra and term rewriting. Besides the texts listed at the end of 
Ghapter I of Tree Automata, there are several other good books on universal algebra. As 
general introductions, I recommend the classicby S. Burris and H. P. Sankappanavar 
and the more recent textbook by C. Bergman [3|. The book 8^ W W. Wechler, written 
expressly for computer scientists, is also very useful. The books [l[ by J. Avenhaus and 
[2] by F. Baader and T. Nipkow offer two good introductions to term rewriting systems. 
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